CIT 11-Safe robot learning

	Back to TUM-PREP Projects overview

Project Overview	Project Code: CIT 11
Project name:	safe-control-gym+: A Unified sim2real Benchmark Pipeline for Safe Robot Learning and Control
TUM Department:	CIT - Electrical and Computer Engineering
TUM Chair / Institute:	TUEILSY Chair of Safety, Performance and Reliability for Learning Systems (Prof. Schoellig)
Research area:	Robotics, Machine Learning, System Control
Student background:	Computer ScienceComputer Science/ InformaticsElectrical EngineeringMechanical Engineering
Further disciplines:
Planned project location:	Theresienstr. 90, 80333 Munich, Germany

Project Supervisor - Contact Details
Title:	Prof.
Given name:	Angela
Family name:	Schoellig
E-mail:	angela.schoellig@tum.de
Phone:	+49 (89) 289 - 25336

Additional Project Supervisor - Contact Details
Title:	Dr.
Given name:	SiQi
Family name:	Zhou
E-mail:	siqi.zhou@tum.de
Phone:	+49 (89) 289 - 25334

Additional Project Supervisor - Contact Details
Title:
Given name:	Lukas
Family name:	Brunke
E-mail:	lukas.brunke@tum.de
Phone:	+49 (89) 289 - 25334

Project Description

Project description:

1. Introduction and Background: Robots are playing an increasingly important role in our lives. Examples include, but are not limited to, autonomous driving, drone delivery, and service robots. In real-world deployments, robots are required to safely operate and learn despite uncertainties such as unknown road and weather conditions or unknown room layouts. Our recent survey paper [1] summarizes the advances in safe decision-making from both the controls and reinforcement learning communities. While algorithms proposed by both communities have shown ever-increasing capabilities of guaranteeing safety under uncertainties, the algorithms are rarely compared to each other on well-defined benchmark problems. To this end, we recently released a simulation benchmark suite, called "safe-control-gym" [2], to examine the performance, data efficiency, and safety of traditional model-based control, learning-based control, and safe reinforcement learning approaches. The benchmark suite currently includes two simulation environments (cart-pole and quadrotor), 12 control/learning-based decision-making algorithms, and a set of metrics for evaluating the algorithms' data efficiency, robustness, and performance.

2. Project Goal and Expectations: The goal of this project is to expand the safe-control-gym and develop a pipeline for transferring the results to real-world experiments. The scope of the project includes adding a state estimation module to the benchmark suite, implementing additional simulation environments (e.g., autonomous cars, robot manipulators, and legged robots), identifying sim2real transfer gaps, and implementing an interface to facilitate the transfer from simulation to the real world. Possible tasks encompass a literature review on sim2real approaches and results in robotics, coding of simulation environments and control and reinforcement learning approaches in Python, and real-world robot experimentation and testing.

3. Outcome: The results of this project will be summarized and published in a robotics journal, and the code will be released as an extended component of the open-source "safe-control-gym" benchmark suite [2].

4. References:
[1] L. Brunke*, M. Greeff*, A. W. Hall*, Z. Yuan*, S. Zhou*, J. Panerati, and A. P. Schoellig. "Safe learning in robotics: From learning-based control to safe reinforcement learning." Annual Review of Control, Robotics, and Autonomous Systems 5 (2022): 411-444. Available at https://arxiv.org/pdf/2108.06266.pdf.

[2] Z. Yuan, A. W. Hall, S. Zhou, L. Brunke, M. Greeff, J. Panerati, and A. P. Schoellig. "safe-Control-Gym: A Unified Benchmark Suite for Safe Learning-Based Control and Reinforcement Learning in Robotics." IEEE Robotics and Automation Letters (2022). Available at https://arxiv.org/pdf/2109.06325.pdf. Code repository: https://github.com/utiasDSL/safe-control-gym.

Working hours per week planned:

40

Prerequisites
Required study level minimum (at time of TUM PREP project start):	2 years of bachelor studies completed
Subject related:	Robotics, control theory, dynamic systems, reinforcement learning, and programming (Python and C++)
Other:	Preferred experience in Robot Operating System (ROS), PyTorch, control or reinforcement learning algorithm implementations, and optimization

Seitenhierarchie

CIT 11-Safe robot learning

Back to TUM-PREP Projects overview

Project Overview

Project Code: CIT 11

Project Supervisor - Contact Details

Additional Project Supervisor - Contact Details

Additional Project Supervisor - Contact Details

Project Description

Prerequisites