| |
Project Overview | Project Code: CIT 11 |
|---|---|
| Project name: | safe-control-gym+: A Unified sim2real Benchmark Pipeline for Safe Robot Learning and Control |
| TUM Department: | CIT - Electrical and Computer Engineering |
| TUM Chair / Institute: | TUEILSY Chair of Safety, Performance and Reliability for Learning Systems (Prof. Schoellig) |
| Research area: | Robotics, Machine Learning, System Control |
| Student background: | Computer ScienceComputer Science/ InformaticsElectrical EngineeringMechanical Engineering |
| Further disciplines: | |
Participation also possible online only: | |
Planned project location: | Theresienstr. 90, 80333 Munich, Germany |
Project Supervisor - Contact Details | |
|---|---|
| Title: | Prof. |
| Given name: | Angela |
| Family name: | Schoellig |
| E-mail: | angela.schoellig@tum.de |
| Phone: | +49 (89) 289 - 25336 |
Additional Project Supervisor - Contact Details | |
|---|---|
| Title: | Dr. |
| Given name: | SiQi |
| Family name: | Zhou |
| E-mail: | contact.lsy@xcit.tum.de |
| Phone: | +49 (89) 289 - 25334 |
Additional Project Supervisor - Contact Details | |
|---|---|
| Title: | |
| Given name: | Lukas |
| Family name: | Brunke |
| E-mail: | contact.lsy@xcit.tum.de |
| Phone: | +49 (89) 289 - 25334 |
Project Description | |
|---|---|
| Project description: | 1. Introduction and Background: Robots are playing an increasingly important role in our lives. Examples include, but are not limited to, autonomous driving, drone delivery, and service robots. In real-world deployments, robots are required to safely operate and learn despite uncertainties such as unknown road and weather conditions or unknown room layouts. Our recent survey paper [1] summarizes the advances in safe decision-making from both the controls and reinforcement learning communities. While algorithms proposed by both communities have shown ever-increasing capabilities of guaranteeing safety under uncertainties, the algorithms are rarely compared to each other on well-defined benchmark problems. To this end, we recently released a simulation benchmark suite, called "safe-control-gym" [2], to examine the performance, data efficiency, and safety of traditional model-based control, learning-based control, and safe reinforcement learning approaches. The benchmark suite currently includes two simulation environments (cart-pole and quadrotor), 12 control/learning-based decision-making algorithms, and a set of metrics for evaluating the algorithms' data efficiency, robustness, and performance. |
| Working hours per week planned: | 40 |
Prerequisites | |
|---|---|
Required study level minimum (at time of TUM PREP project start): | 2 years of bachelor studies completed |
| Subject related: | Robotics, control theory, dynamic systems, reinforcement learning, and programming (Python and C++) |
| Other: | Preferred experience in Robot Operating System (ROS), PyTorch, control or reinforcement learning algorithm implementations, and optimization |
