Empirical Inference Conference Paper 2009

Learning Complex Motions by Sequencing Simpler Motion Templates

Abstraction of complex, longer motor tasks into simpler elemental movements enables humans and animals to exhibit motor skills which have not yet been matched by robots. Humans intuitively decompose complex motions into smaller, simpler segments. For example when describing simple movements like drawing a triangle with a pen, we can easily name the basic steps of this movement. Surprisingly, such abstractions have rarely been used in artificial motor skill learning algorithms. These algorithms typically choose a new action (such as a torque or a force) at a very fast time-scale. As a result, both policy and temporal credit assignment problem become unnecessarily complex - often beyond the reach of current machine learning methods. We introduce a new framework for temporal abstractions in reinforcement learning (RL), i.e. RL with motion templates. We present a new algorithm for this framework which can learn high-quality policies by making only few abstract decisions.

Author(s): Neumann, G. and Maass, W. and Peters, J.
Book Title: ICML 2009
Journal: Proceedings of the 26th International Conference on Machine Learning (ICML 2009)
Pages: 753-760
Year: 2009
Month: June
Day: 0
Editors: Danyluk, A. , L. Bottou, M. Littman
Publisher: ACM Press
Bibtex Type: Conference Paper (inproceedings)
Address: New York, NY, USA
DOI: 10.1145/1553374.1553471
Event Name: 26th International Conference on Machine Learning
Event Place: Montreal, Canada
Digital: 0
Electronic Archiving: grant_archive
Language: en
Organization: Max-Planck-Gesellschaft
School: Biologische Kybernetik
Links:

BibTex

@inproceedings{5880,
  title = {Learning Complex Motions by Sequencing Simpler Motion Templates},
  journal = {Proceedings of the 26th International Conference on Machine Learning (ICML 2009)},
  booktitle = {ICML 2009},
  abstract = {Abstraction of complex, longer motor tasks
  into simpler elemental movements enables
  humans and animals to exhibit motor skills
  which have not yet been matched by robots.
  Humans intuitively decompose complex motions
  into smaller, simpler segments. For
  example when describing simple movements
  like drawing a triangle with a pen, we can
  easily name the basic steps of this movement.
  Surprisingly, such abstractions have rarely
  been used in artificial motor skill learning algorithms.
  These algorithms typically choose
  a new action (such as a torque or a force) at a
  very fast time-scale. As a result, both policy
  and temporal credit assignment problem become
  unnecessarily complex - often beyond
  the reach of current machine learning methods.
  We introduce a new framework for temporal
  abstractions in reinforcement learning (RL),
  i.e. RL with motion templates. We present a
  new algorithm for this framework which can
  learn high-quality policies by making only
  few abstract decisions.},
  pages = {753-760},
  editors = {Danyluk, A. , L. Bottou, M. Littman},
  publisher = {ACM Press},
  organization = {Max-Planck-Gesellschaft},
  school = {Biologische Kybernetik},
  address = {New York, NY, USA},
  month = jun,
  year = {2009},
  slug = {5880},
  author = {Neumann, G. and Maass, W. and Peters, J.},
  month_numeric = {6}
}