Controller Learning using Bayesian Optimization

Institute Homepage

Institute Homepage Sign In

Back

Research Overview

Intrinsically Motivated Learning

Regularity as Intrinsic Reward for Free Play

SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models

Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation

Learning with Muscles

Natural and Robust Walking from Generic Rewards

The effect of muscles in Learning Behavior

Scaling RL to Large Musculoskeletal Systems

Reinforcement Learning for Diverse Solutions

Offline Diversity Under Imitation Constraints

Learning Diverse Skills for Local Navigation

Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations

Reinforcement Learning and Control

Model-based Reinforcement Learning and Planning

Object-centric Self-supervised Reinforcement Learning

Self-exploration of Behavior

Causal Reasoning in RL

Equation Learner for Extrapolation and Control

Intrinsically Motivated Hierarchical Learner

Regularity as Intrinsic Reward for Free Play

Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation

Natural and Robust Walking from Generic Rewards

Goal-conditioned Offline Planning

Offline Diversity Under Imitation Constraints

Learning Diverse Skills for Local Navigation

Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations

Deep Learning

Combinatorial Optimization as a Layer / Blackbox Differentiation

Object-centric Self-supervised Reinforcement Learning

Symbolic Regression and Equation Learning

Representation Learning

Stepsize adaptation for stochastic optimization

Probabilistic Neural Networks

Learning with 3D rotations: A hitchhiker’s guide to SO(3)

Haptic Sensing

Super-resolution Sensing for Haptics

Insight: a Haptic Sensor Powered by Vision and Machine Learning

Minsight: Learning-based tactile sensing for robotics

ML for Science

Predicting brain activity (fMRI)

Equation Learning for Statistical Physics

Machine Learning for Understanding Quantum Systems

Symbolic Regression and Equation Learning

Previous Research Projects

The Playful Machine

Robust and Affordable Haptic Sensation with Sparse Sensor Configuration

Intelligent Control Systems Probabilistic Numerics Members Publications

Controller Learning using Bayesian Optimization

Webpagepic2 — Left: Humanoid robot Apollo learning to balance an inverted pole using Bayesian optimization. Right: One-dimensional synthetic example of an unknown cost J($\theta$) modeled as a Gaussian process for controller parameter $\theta$, conditioned on observed data points. The next controller to evaluate is suggested by the Bayesian optimizer where the acquisition function $\alpha(\theta)$ finds its maximum.

Autonomous systems such as humanoid robots are characterized by a multitude of feedback control loops operating at different hierarchical levels and time-scales. Designing and tuning these controllers typically requires significant manual modeling and design effort and exhaustive experimental testing. For managing the ever greater complexity and striving for greater autonomy, it is desirable to tailor intelligent algorithms that allow autonomous systems to learn from experimental data. In our research, we leverage automatic control theory, machine learning, and optimization to develop automatic control design and tuning algorithms.

In [], we propose a framework where an initial controller is automatically improved based on observed performance from a limited number of experiments. Entropy Search (ES) [] serves as the underlying Bayesian optimizer for the auto-tuning method. It represents the latent control objective as a Gaussian process (GP) (see above figure) and sequentially suggests those controllers that are most informative about the location of the optimum. We validate the developed approaches on the experimental platforms at our institute (see figure).

We have extended this framework into different directions to further improve data efficiency. When auto-tuning real complex systems (like humanoid robots), simulations of the system dynamics are typically available. They provide less accurate information than real experiments, but at a cheaper cost. Under limited experimental cost budget (i.e., experiment total time), our work [] extends ES to include the simulator as an additional information source and automatically trade off information vs. cost.

The aforementioned auto-tuning methods model the performance objective using standard GP models, typically agnostic to the control problem. In [], the covariance function of the GP model is tailored to the control problem at hand by incorporating its mathematical structure into the kernel design. In this way, unforeseen observations of the objective are predicted more accurately. This ultimately speeds up the convergence of the Bayesian optimizer.

Bayesian optimization provides a powerful framework for controller learning, which we have successfully applied on very different settings: humanoid robots [], micro robots [] and automotive industry [].

Members

Intelligent Control Systems

Alonso Marco Valle

Intelligent Control Systems

Sebastian Trimpe

Probabilistic Numerics, Empirische Inferenz

Philipp Hennig

Affiliated Researcher

Autonomous Motion

Jeannette Bohg

Affiliated Researcher

Autonomous Motion

Stefan Schaal

Director

Intelligent Control Systems

Alexander von Rohr

Publications

Intelligent Control Systems Article Data-efficient Autotuning with Bayesian Optimization: An Industrial Control Study Neumann-Brosig, M., Marco, A., Schwarzmann, D., Trimpe, S. IEEE Transactions on Control Systems Technology, 28(3):730-740, May 2020 (Published) arXiv (PDF) DOI BibTeX

Intelligent Control Systems Micro, Nano, and Molecular Systems Conference Paper Gait learning for soft microrobots controlled by light fields Rohr, A. V., Trimpe, S., Marco, A., Fischer, P., Palagi, S. In International Conference on Intelligent Robots and Systems (IROS) 2018, :6199-6206, Piscataway, NJ, USA, International Conference on Intelligent Robots and Systems, October 2018 (Published) arXiv IEEE Xplore DOI URL BibTeX

Autonomous Motion Probabilistic Numerics Intelligent Control Systems Conference Paper On the Design of LQR Kernels for Efficient Controller Learning Marco, A., Hennig, P., Schaal, S., Trimpe, S. Proceedings of the 56th IEEE Annual Conference on Decision and Control (CDC), :5193-5200, IEEE, IEEE Conference on Decision and Control, December 2017 (Published) arXiv PDF On the Design of LQR Kernels for Efficient Controller Learning - CDC presentation DOI BibTeX

Autonomous Motion Probabilistic Numerics Intelligent Control Systems Conference Paper Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization Marco, A., Berkenkamp, F., Hennig, P., Schoellig, A. P., Krause, A., Schaal, S., Trimpe, S. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), :1557-1563, IEEE, Piscataway, NJ, USA, May 2017 (Published) PDF arXiv ICRA 2017 Spotlight presentation Virtual vs. Real - Video explanation DOI BibTeX

Autonomous Motion Probabilistic Numerics Intelligent Control Systems Conference Paper Automatic LQR Tuning Based on Gaussian Process Global Optimization Marco, A., Hennig, P., Bohg, J., Schaal, S., Trimpe, S. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), :270-277, IEEE, IEEE International Conference on Robotics and Automation, May 2016 (Published) Video - Automatic LQR Tuning Based on Gaussian Process Global Optimization - ICRA 2016 Video - Automatic Controller Tuning on a Two-legged Robot PDF DOI BibTeX

Autonomous Motion Empirical Inference Probabilistic Numerics Intelligent Control Systems Conference Paper Automatic LQR Tuning Based on Gaussian Process Optimization: Early Experimental Results Marco, A., Hennig, P., Bohg, J., Schaal, S., Trimpe, S. Machine Learning in Planning and Control of Robot Motion Workshop at the IEEE/RSJ International Conference on Intelligent Robots and Systems (iROS), Machine Learning in Planning and Control of Robot Motion Workshop, October 2015 (Published) PDF DOI BibTeX

Autonomous Motion Intelligent Control Systems Master Thesis Gaussian Process Optimization for Self-Tuning Control Marco, A. Polytechnic University of Catalonia (BarcelonaTech), October 2015 () PDF BibTeX

Empirical Inference Probabilistic Numerics Article Entropy Search for Information-Efficient Global Optimization Hennig, P., Schuler, C. Journal of Machine Learning Research, 13:1809-1837, -, June 2012 () PDF Web BibTeX