DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems

Institute Homepage

Institute Homepage EN Sign In

Back

Autonomous Learning Empirische Inferenz Conference Paper 2023

Empirische Inferenz

Pierre Schumacher

Doctoral Researcher

Empirische Inferenz

Dieter Büchler

Research Group Leader

Empirische Inferenz, Autonomous Learning

Georg Martius

Senior Research Scientist

Muscle-actuated organisms are capable of learning an unparalleled diversity of dexterous movements despite their vast amount of muscles. Reinforcement learning (RL) on large musculoskeletal models, however, has not been able to show similar performance. We conjecture that ineffective exploration in large overactuated action spaces is a key problem. This is supported by our finding that common exploration noise strategies are inadequate in synthetic examples of overactuated systems. We identify differential extrinsic plasticity (DEP), a method from the domain of self-organization, as being able to induce state-space covering exploration within seconds of interaction. By integrating DEP into RL, we achieve fast learning of reaching and locomotion in musculoskeletal systems, outperforming current approaches in all considered tasks in sample efficiency and robustness.

Author(s):	Pierre Schumacher and Daniel F.B. Haeufle and Dieter Büchler and Syn Schmitt and Georg Martius
Book Title:	The Eleventh International Conference on Learning Representations (ICLR)
Year:	2023
Month:	May
Day:	1-5

Project(s):	Scaling RL to Large Musculoskeletal Systems
Bibtex Type:	Conference Paper (inproceedings)

Event Place:	Rwanda, Africa
State:	Published
URL:	https://openreview.net/forum?id=C-xa_D3oTj6

Electronic Archiving:	grant_archive
Talk Type:	Oral (notable-top-25%)

Links:	Arxiv pdf Website

BibTex

@inproceedings{schumacher2023:deprl,
  title = {DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems},
  booktitle = {The Eleventh International Conference on Learning Representations (ICLR)},
  abstract = {Muscle-actuated organisms are capable of learning an unparalleled diversity of
  dexterous movements despite their vast amount of muscles. Reinforcement learning (RL) on large musculoskeletal models, however, has not been able to show
  similar performance. We conjecture that ineffective exploration in large overactuated action spaces is a key problem. This is supported by our finding that common
  exploration noise strategies are inadequate in synthetic examples of overactuated
  systems. We identify differential extrinsic plasticity (DEP), a method from the
  domain of self-organization, as being able to induce state-space covering exploration within seconds of interaction. By integrating DEP into RL, we achieve fast
  learning of reaching and locomotion in musculoskeletal systems, outperforming
  current approaches in all considered tasks in sample efficiency and robustness.},
  month = may,
  year = {2023},
  slug = {schumacher2023-deprl},
  author = {Schumacher, Pierre and Haeufle, Daniel F.B. and B{\"u}chler, Dieter and Schmitt, Syn and Martius, Georg},
  url = {https://openreview.net/forum?id=C-xa_D3oTj6},
  month_numeric = {5}
}

Forschung

Abteilungen

Forschungsgruppen

Personen

Kontakt

Our Institute

Unsere Geschichte

Karriere

Überblick über Promotionsprogramme

Karriere

Service-Einrichtungen

Zentrale Wissenschaftliche Einrichtungen

Werkstätten

Campus Services

Impact

Kooperationen

Initiativen und Partner

Forschung

Abteilungen

Forschungsgruppen

Personen

Kontakt

Our Institute

Unsere Geschichte

Karriere

Überblick über Promotionsprogramme

Karriere

Service-Einrichtungen

Zentrale Wissenschaftliche Einrichtungen

Werkstätten

Campus Services

Impact

Kooperationen

Initiativen und Partner

BibTex