Inverse Optimal Control

Institute Homepage

Institute Homepage Sign In

Back

Research Overview

Learning Control

Learning Coupling Terms of Movement Primitives

Incremental Local Regression

Perception for Action

Autonomous Robotic Manipulation

Modeling Top-Down Saliency for Visual Object Search

Interactive Perception

State Estimation and Sensor Fusion for the Control of Legged Robots

Probabilistic Object and Manipulator Tracking

Global Object Shape Reconstruction by Fusing Visual and Tactile Data

Robot Arm Pose Estimation as a Learning Problem

Learning to Grasp from Big Data

Gaussian Filtering as Variational Inference

Template-Based Learning of Model Free Grasping

Associative Skill Memories

Real-Time Perception meets Reactive Motion Generation

Motion planning and control

Autonomous Robotic Manipulation

Learning Coupling Terms of Movement Primitives

State Estimation and Sensor Fusion for the Control of Legged Robots

Inverse Optimal Control

Motion Optimization

Optimal Control for Legged Robots

Movement Representation for Reactive Behavior

Associative Skill Memories

Real-Time Perception meets Reactive Motion Generation

Neural Control of Movement

Experimental Robotics

Autonomous Robotic Manipulation

Inverse Optimal Control

Motion Optimization

Optimal Control for Legged Robots

Associative Skill Memories

Real-Time Perception meets Reactive Motion Generation

Other

Autonomous Motion Members Publications

Inverse Optimal Control

Screenshot from 2015 09 10 11 35 03 — (a) Kinesthetic demonstration of a preferred kinematic configuration for grasping the flashlight. (d) Demonstration of a full trajectory from the home configuration to the grasp configuration.

Tuning and designing robotic behavior by combining elementary objective terms is a tedious task which generally consists of finding proper representations for each new skill. Inverse Optimal Control (IOC) allows, by specifying a set of basis functions (or features), to learn the right association of objective terms defining a policy imitating that of the expert. The Inverse Reinforcement Learning (IRL) formulation, where the generative model is formalized using a Markov Decision Process, was introduced in the early 2000 by Ng et al., and has since then received a lot of attention from the machine learning and robotics communities. At the Autonomous Motion Department, we seek to improve the state-of-the-art in IOC/IRL to learn motion policies for robotic systems from demonstrations as well as better understand and predict human motion. Hence we are interested in being able to specify a broad number of basis functions and handle the high-dimensional continuous cases involved in manipulation and motion generation.

Our path integral IRL algorithm can deal with such high-dimensional continuous state-action spaces, and only requires local optimality of the demonstrated trajectories. We use regularization in order to achieve feature selection, and propose an efficient algorithm to minimize the resulting convex objective function. Our approach has been applied to two core problems in robotic manipulation: Learning a cost function for redundancy resolution in inverse kinematics, and learning a cost function over trajectories, which is then used in optimization based motion planning for grasping and manipulation tasks.

We are also interested in enabling to handle noisy and incomplete demonstrations. Thus we have built on recent work in direct loss minimization structured prediction to suggest that IOC can be (counterintuitively) formalized as a form of policy search reinforcement learning. This connection allows to transition smoothly from imitating an expert to teaching itself over time, while handling noisy demonstrations. We use this procedure to calibrate our motion optimizers on Apollo.

Finally, we are studying how IOC can be used to define an optimality principle for human behavior by gathering human motion using a motion capture system. This optimality criterion can then be used to predict human motion through trajectory optimization. We have demonstrated this ability on the particular case of learning a model for collaborative manipulation task, where two humans perform an assembly task, and were successfully able to predict human reaching motion even when there is significant interference between the two partners.

Members

Movement Generation and Control

Director

Intelligent Control Systems

Andreas Doerr

Publications

Autonomous Motion Intelligent Control Systems Conference Paper Direct Loss Minimization Inverse Optimal Control Doerr, A., Ratliff, N., Bohg, J., Toussaint, M., Schaal, S. In Proceedings of Robotics: Science and Systems, Rome, Italy, Robotics: Science and Systems XI, July 2015 (Published) PDF Video BibTeX

Autonomous Motion Intelligent Control Systems Thesis Policy Search for Imitation Learning Doerr, A. University of Stuttgart, January 2015 () URL BibTeX

Autonomous Motion Conference Paper Predicting Human Reaching Motion in Collaborative Tasks Using Inverse Optimal Control and Iterative Re-planning Mainprice, J., Hayne, R., Berenson, D. In Proceedings of the IEEE International Conference on Robotics and Automation, 2015 () BibTeX

Autonomous Motion Ph.D. Thesis Learning objective functions for autonomous motion generation Kalakrishnan, M. University of Southern California, University of Southern California, Los Angeles, CA, 2014 () BibTeX

Autonomous Motion Conference Paper STOMP: Stochastic trajectory optimization for motion planning Kalakrishnan, M., Chitta, S., Theodorou, E., Pastor, P., Schaal, S. In IEEE International Conference on Robotics and Automation (ICRA), Shanghai, China, May 9-13, 2011, clmc () URL BibTeX

Autonomous Motion Article Learning, planning, and control for quadruped locomotion over challenging terrain Kalakrishnan, M., Buchli, J., Pastor, P., Mistry, M., Schaal, S. International Journal of Robotics Research, 30(2):236-258, 2010, clmc () URL BibTeX

Autonomous Motion Conference Paper Learning locomotion over rough terrain using terrain templates Kalakrishnan, M., Buchli, J., Pastor, P., Schaal, S. In Intelligent Robots and Systems, 2009. IROS 2009. IEEE/RSJ International Conference on, :167-172, 2009, clmc () URL BibTeX