Probabilistic Inference for Fast Learning in Control

Institute Homepage DE Sign In

Empirical Inference Conference Paper 2008

We provide a novel framework for very fast model-based reinforcement learning in continuous state and action spaces. The framework requires probabilistic models that explicitly characterize their levels of confidence. Within this framework, we use flexible, non-parametric models to describe the world based on previously collected experience. We demonstrate learning on the cart-pole problem in a setting where we provide very limited prior knowledge about the task. Learning progresses rapidly, and a good policy is found after only a hand-full of iterations.

Author(s):	Rasmussen, CE. and Deisenroth, MP.
Book Title:	EWRL 2008
Journal:	Recent Advances in Reinforcement Learning: 8th European Workshop (EWRL 2008)
Pages:	229-242
Year:	2008
Month:	November
Day:	0
Editors:	Girgin, S. , M. Loth, R. Munos, P. Preux, D. Ryabko
Publisher:	Springer

Bibtex Type:	Conference Paper (inproceedings)

Address:	Berlin, Germany
DOI:	10.1007/978-3-540-89722-4_18
Event Name:	8th European Workshop on Reinforcement Learning
Event Place:	Villeneuve d‘Ascq, France

Digital:	0
Electronic Archiving:	grant_archive
Language:	en
Organization:	Max-Planck-Gesellschaft
School:	Biologische Kybernetik

Links:	PDF Web

BibTex

@inproceedings{5398,
  title = {Probabilistic Inference for Fast Learning in Control},
  journal = {Recent Advances in Reinforcement Learning: 8th European Workshop (EWRL 2008)},
  booktitle = {EWRL 2008},
  abstract = {We provide a novel framework for very fast model-based reinforcement learning in continuous state and action spaces. The framework requires probabilistic models that explicitly characterize their levels of confidence. Within this framework, we use flexible, non-parametric models to describe the world based on previously collected experience. We demonstrate learning on the cart-pole problem in a setting where we provide very limited prior knowledge about the task. Learning progresses rapidly, and a good policy is found after only a hand-full of iterations.},
  pages = {229-242},
  editors = {Girgin, S. , M. Loth, R. Munos, P. Preux, D. Ryabko},
  publisher = {Springer},
  organization = {Max-Planck-Gesellschaft},
  school = {Biologische Kybernetik},
  address = {Berlin, Germany},
  month = nov,
  year = {2008},
  slug = {5398},
  author = {Rasmussen, CE. and Deisenroth, MP.},
  month_numeric = {11}
}

Research

Departments

Research Groups

People

Contact

Our Institute

Our History

Career

Doctoral Programs

Training

Service Units

Central Scientific Facilities

Workshops

Campus Services

Impact

Cooperation

Partners and Initiatives