Bayesian Optimization

Institute Homepage

Institute Homepage Sign In

Back

Research Overview

Intrinsically Motivated Learning

Regularity as Intrinsic Reward for Free Play

SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models

Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation

Learning with Muscles

Natural and Robust Walking from Generic Rewards

The effect of muscles in Learning Behavior

Scaling RL to Large Musculoskeletal Systems

Reinforcement Learning for Diverse Solutions

Offline Diversity Under Imitation Constraints

Learning Diverse Skills for Local Navigation

Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations

Reinforcement Learning and Control

Model-based Reinforcement Learning and Planning

Object-centric Self-supervised Reinforcement Learning

Self-exploration of Behavior

Causal Reasoning in RL

Equation Learner for Extrapolation and Control

Intrinsically Motivated Hierarchical Learner

Regularity as Intrinsic Reward for Free Play

Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation

Natural and Robust Walking from Generic Rewards

Goal-conditioned Offline Planning

Offline Diversity Under Imitation Constraints

Learning Diverse Skills for Local Navigation

Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations

Deep Learning

Combinatorial Optimization as a Layer / Blackbox Differentiation

Object-centric Self-supervised Reinforcement Learning

Symbolic Regression and Equation Learning

Representation Learning

Stepsize adaptation for stochastic optimization

Probabilistic Neural Networks

Learning with 3D rotations: A hitchhiker’s guide to SO(3)

Haptic Sensing

Super-resolution Sensing for Haptics

Insight: a Haptic Sensor Powered by Vision and Machine Learning

Minsight: Learning-based tactile sensing for robotics

ML for Science

Predicting brain activity (fMRI)

Equation Learning for Statistical Physics

Machine Learning for Understanding Quantum Systems

Symbolic Regression and Equation Learning

Previous Research Projects

The Playful Machine

Robust and Affordable Haptic Sensation with Sparse Sensor Configuration

Probabilistic Numerics Members Publications

Bayesian Optimization

Bayesian Optimization is an increasingly popular approach to industrial and scientific prototyping problems. The basic premise in this setting is that one is looking for a location $x$ in some domain where a fitness function $f(x)$ is (globally) minimized. The additional, sometimes implicit, assumption is that individual evaluations of $f$ have comparably high computational or monetary cost (e.g. because they involve building a physical prototype, or running a robot for a few minutes). To avoid this high cost as much as possible, one thus builds a cheaper surrogate model for the true objective. If this model is probabilistic (i.e. it spreads probability mass over a space of possible true values of the objective) it can be used to reason about which physical experiments would be most useful to perform in pursuit of the true extremum.

Our contribution to this area is the development of a class of Bayesian optimization algorithms, known as Entropy Search [] that retain an explicit model for the location of a function's minimum, and reason about changes to this distribution effected by future experiments. Entropy Search is not a cheap method, but it provides a powerful representation in which one can reason about the information content of various kinds of experiments, and take decisions that take into account varying costs and quality of potential experiments.

In recent years, in collaboration with the Intelligent Control Systems group, the Entropy Search framework has been used to build advanced functionality for experimental design in automated machine learning and robotics. This includes the ability to simultaneously and efficiently use and trade off experimental channels of varying fidelity and cost [], and the effective use of strong analytical knowledge about a problem [].

Members

Probabilistic Numerics, Empirical Inference

Philipp Hennig

Affiliated Researcher

Probabilistic Numerics

Simon Bartels

Doctoral Researcher

Intelligent Control Systems

Alonso Marco Valle

Intelligent Control Systems

Sebastian Trimpe

Publications

Autonomous Motion Probabilistic Numerics Intelligent Control Systems Conference Paper On the Design of LQR Kernels for Efficient Controller Learning Marco, A., Hennig, P., Schaal, S., Trimpe, S. Proceedings of the 56th IEEE Annual Conference on Decision and Control (CDC), :5193-5200, IEEE, IEEE Conference on Decision and Control, December 2017 (Published) arXiv PDF On the Design of LQR Kernels for Efficient Controller Learning - CDC presentation DOI BibTeX

Autonomous Motion Probabilistic Numerics Intelligent Control Systems Conference Paper Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization Marco, A., Berkenkamp, F., Hennig, P., Schoellig, A. P., Krause, A., Schaal, S., Trimpe, S. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), :1557-1563, IEEE, Piscataway, NJ, USA, May 2017 (Published) PDF arXiv ICRA 2017 Spotlight presentation Virtual vs. Real - Video explanation DOI BibTeX

Probabilistic Numerics Conference Paper Fast Bayesian Optimization of Machine Learning Hyperparameters on Large Datasets Klein, A., Falkner, S., Bartels, S., Hennig, P., Hutter, F. Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS), 54:528-536, Proceedings of Machine Learning Research, (Editors: Sign, Aarti and Zhu, Jerry), PMLR, April 2017 (Published) pdf URL BibTeX

Autonomous Motion Probabilistic Numerics Intelligent Control Systems Conference Paper Automatic LQR Tuning Based on Gaussian Process Global Optimization Marco, A., Hennig, P., Bohg, J., Schaal, S., Trimpe, S. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), :270-277, IEEE, IEEE International Conference on Robotics and Automation, May 2016 (Published) Video - Automatic LQR Tuning Based on Gaussian Process Global Optimization - ICRA 2016 Video - Automatic Controller Tuning on a Two-legged Robot PDF DOI BibTeX

Probabilistic Numerics Conference Paper Batch Bayesian Optimization via Local Penalization González, J., Dai, Z., Hennig, P., Lawrence, N. Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (AISTATS), 51:648-657, JMLR Workshop and Conference Proceedings, (Editors: Gretton, A. and Robert, C. C.), May 2016 (Published) URL BibTeX

Autonomous Motion Empirical Inference Probabilistic Numerics Intelligent Control Systems Conference Paper Automatic LQR Tuning Based on Gaussian Process Optimization: Early Experimental Results Marco, A., Hennig, P., Bohg, J., Schaal, S., Trimpe, S. Machine Learning in Planning and Control of Robot Motion Workshop at the IEEE/RSJ International Conference on Intelligent Robots and Systems (iROS), Machine Learning in Planning and Control of Robot Motion Workshop, October 2015 (Published) PDF DOI BibTeX

Autonomous Motion Intelligent Control Systems Master Thesis Gaussian Process Optimization for Self-Tuning Control Marco, A. Polytechnic University of Catalonia (BarcelonaTech), October 2015 () PDF BibTeX

Empirical Inference Probabilistic Numerics Article Entropy Search for Information-Efficient Global Optimization Hennig, P., Schuler, C. Journal of Machine Learning Research, 13:1809-1837, -, June 2012 () PDF Web BibTeX