Learning Diverse Skills for Local Navigation under Multi-constraint Optimality

Institute Homepage DE Sign In

Back

Autonomous Learning Robotics Conference Paper 2023

Autonomous Learning

Jin Cheng

Intern

Autonomous Learning

Marin Vlastelica Pogancic

Autonomous Learning

Pavel Kolev

Empirical Inference, Autonomous Learning

Georg Martius

Senior Research Scientist

Despite many successful applications of data-driven control in robotics, extracting meaningful diverse behaviors remains a challenge. Typically, task performance needs to be compromised in order to achieve diversity. In many scenarios, task requirements are specified as a multitude of reward terms, each requiring a different trade-off. In this work, we take a constrained optimization viewpoint on the quality-diversity trade-off and show that we can obtain diverse policies while imposing constraints on their value functions which are defined through distinct rewards. In line with previous work, further control of the diversity level can be achieved through an attract-repel reward term motivated by the Van der Waals force. We demonstrate the effectiveness of our method on a local navigation task where a quadruped robot needs to reach the target within a finite horizon. Finally, our trained policies transfer well to the real 12-DoF quadruped robot, Solo12, and exhibit diverse agile behaviors with successful obstacle traversal.

Author(s):	Cheng, Jin and Vlastelica, Marin and Kolev, Pavel and Li, Chenhao and Martius, Georg
Book Title:	Learning Diverse Skills for Local Navigation under Multi-constraint Optimality
Pages:	5083--5089
Year:	2023
Month:	October
Day:	03

Bibtex Type:	Conference Paper (inproceedings)

DOI:	10.1109/ICRA57147.2024.10611629
State:	Published
URL:	https://arxiv.org/abs/2310.02440

Links:	Website

BibTex

@inproceedings{Cheng2024:DOMINIC,
  title = {Learning Diverse Skills for Local Navigation under Multi-constraint Optimality},
  booktitle = {Learning Diverse Skills for Local Navigation under Multi-constraint Optimality},
  abstract = {Despite many successful applications of data-driven control in robotics, extracting meaningful diverse behaviors remains a challenge. Typically, task performance needs to be compromised in order to achieve diversity. In many scenarios, task requirements are specified as a multitude of reward terms, each requiring a different trade-off. In this work, we take a constrained optimization viewpoint on the quality-diversity trade-off and show that we can obtain diverse policies while imposing constraints on their value functions which are defined through distinct rewards. In line with previous work, further control of the diversity level can be achieved through an attract-repel reward term motivated by the Van der Waals force. We demonstrate the effectiveness of our method on a local navigation task where a quadruped robot needs to reach the target within a finite horizon. Finally, our trained policies transfer well to the real 12-DoF quadruped robot, Solo12, and exhibit diverse agile behaviors with successful obstacle traversal. },
  pages = {5083--5089},
  month = oct,
  year = {2023},
  slug = {cheng2024-dominic},
  author = {Cheng, Jin and Vlastelica, Marin and Kolev, Pavel and Li, Chenhao and Martius, Georg},
  url = {https://arxiv.org/abs/2310.02440},
  month_numeric = {10}
}

Research

Departments

Research Groups

People

Contact

Our Institute

Our History

Career

Doctoral Programs

Training

Service Units

Central Scientific Facilities

Workshops

Campus Services

Impact

Cooperation

Partners and Initiatives

Research

Departments

Research Groups

People

Contact

Our Institute

Our History

Career

Doctoral Programs

Training

Service Units

Central Scientific Facilities

Workshops

Campus Services

Impact

Cooperation

Partners and Initiatives

BibTex