Shaping in Practice: Training Wheels to Learn Fast Hopping Directly in Hardware

Institute Homepage

Institute Homepage DE Sign In

Back

Dynamic Locomotion Conference Paper 2018

Intelligent Control Systems

Steve Heim

Postdoctoral Researcher

Dynamic Locomotion

Felix Ruppert

Doctoral Researcher

Dynamic Locomotion

Alborz Aghamaleki Sarvestani

Doctoral Researcher

Dynamic Locomotion, Haptic Intelligence

Alexander Badri-Spröwitz

Senior Research Scientist

Learning instead of designing robot controllers can greatly reduce engineering effort required, while also emphasizing robustness. Despite considerable progress in simulation, applying learning directly in hardware is still challenging, in part due to the necessity to explore potentially unstable parameters. We explore the of concept shaping the reward landscape with training wheels; temporary modifications of the physical hardware that facilitate learning. We demonstrate the concept with a robot leg mounted on a boom learning to hop fast. This proof of concept embodies typical challenges such as instability and contact, while being simple enough to empirically map out and visualize the reward landscape. Based on our results we propose three criteria for designing effective training wheels for learning in robotics.

Author(s):	Steve Heim and Felix Ruppert and Alborz Sarvestani and Alexander Sproewitz
Book Title:	Proceedings of the IEEE International Conference on Robotics and Automation (ICRA) 2018
Pages:	5076-5081
Year:	2018
Month:	May
Publisher:	IEEE

Bibtex Type:	Conference Paper (inproceedings)

Event Name:	International Conference on Robotics and Automation
Event Place:	Brisbane, Australia
State:	Published
URL:	https://arxiv.org/abs/1709.10273

Electronic Archiving:	grant_archive
ISBN:	978-1-5386-3080-8/18

Links:	Video Youtube

BibTex

@inproceedings{shaping_icra2018,
  title = {Shaping in Practice: Training Wheels to Learn Fast Hopping Directly in Hardware},
  booktitle = {Proceedings of the IEEE International Conference on Robotics and Automation (ICRA) 2018},
  abstract = {Learning instead of designing robot controllers can greatly reduce engineering effort required, while also emphasizing robustness. Despite considerable progress in simulation, applying learning directly in hardware is still challenging, in part due to the necessity to explore potentially unstable parameters. We explore the of concept shaping the reward landscape with training wheels; temporary modifications of the physical hardware that facilitate learning. We demonstrate the concept with a robot leg mounted on a boom learning to hop fast. This proof of concept embodies typical challenges such as instability and contact, while being simple enough to empirically map out and visualize the reward landscape. Based on our results we propose three criteria for designing effective training wheels for learning in robotics.},
  pages = {5076-5081},
  publisher = {IEEE},
  month = may,
  year = {2018},
  slug = {robot-hopping-learning-hardware},
  author = {Heim, Steve and Ruppert, Felix and Sarvestani, Alborz and Sproewitz, Alexander},
  url = {https://arxiv.org/abs/1709.10273},
  month_numeric = {5}
}

Research

Departments

Research Groups

People

Contact

Our Institute

Our History

Career

Doctoral Programs

Training

Service Units

Central Scientific Facilities

Workshops

Campus Services

Impact

Cooperation

Partners and Initiatives