Differential Geometry for Representation Learning

Institute Homepage

Institute Homepage Sign In

Back

Research Overview

Inferring and exploiting contact

Generative Proxemics: A Prior for 3D Social Interaction from Images

BITE -- Dog Shape and Pose from an Image

HOLD -- inferring 3D hand and object shape from video

MOVER -- Reconstructing 3D Scenes and People using Interaction

Datasets for understanding humans and animals

The Poses for Equine Research Dataset (PFERD)

BEAT2 Dataset for Holistic Co-Speech Gesture Generation

ARCTIC: A Dataset for Dexterous Bimanual Hand-Object Manipulation

The BioAMASS Dataset

OpenCapBench dataset

Human health and the 3D body

Body Shape Models in Treating Anorexia Nervosa

Customized Bone Plants for Humerus Shaft Fractures

Reconstructing Signing Avatars From Video Using Linguistic Priors

The AI animator

HAAR: Text-Conditioned Generative Model of 3D Strand-based Human Hairstyles

Gaussian Garments

PuzzleAvatar: Assembling 3D Avatars from Personal Albums

FLARE: Fast Learning of Animatable and Relightable Mesh Avatars

Language, Vision, and World Models

AWOL: Analysis WithOut synthesis using Language

Re-Thinking Inverse Graphics with Large Language Models

TeCH: Text-guided Reconstruction of Clothed Humans

Human pose, shape, and motion capture

WHAM: Reconstructing World-grounded Humans with Accurate 3D Motion

3D Human Pose Estimation via Intuitive Physics

Accurate 3D Body Shape Regression using Metric and Semantic Attributes

BEV

Generating human motion

Generating Human Interaction Motions in Scenes with Text Control

TEMOS: Generating Diverse Human Motions from Text

EMAGE: Full-body Gestures from Audio

TEACH: Temporal Action Compositions for 3D Humans

Robot Perception Group

AirCap: 3D Motion Capture

AirCap: Perception-Based Control

AirCapRL: Aerial Motion Capture Using Deep RL

Data Team

Lab Tours and Public Outreach

Collecting Data - From the Idea to the Publication

Capture Technologies Setup

Completed Projects

Human Pose, Shape and Action

3D Pose from Images

2D Pose from Images

Beyond Motion Capture

Action and Behavior

Body Perception

Body Applications

Pose and Motion Priors

Clothing Models (2011-2015)

Reflectance Filtering

Learning on Manifolds

Markerless Animal Motion Capture

Multi-Camera Capture

2D Pose from Optical Flow

Body Perception

Neural Prosthetics and Decoding

Part-based Body Models

Intrinsic Depth

Lie Bodies

Layers, Time and Segmentation

Understanding Action Recognition (JHMDB)

Intrinsic Video

Intrinsic Images

Action Recognition with Tracking

Neural Control of Grasping

Flowing Puppets

Faces

Deformable Structures

Model-based Anthropometry

Modeling 3D Human Breathing

Optical flow in the LGN

FlowCap

Smooth Loops from Unconstrained Video

PCA Flow

Efficient and Scalable Inference

Motion Blur in Layers

Facade Segmentation

Smooth Metric Learning

Robust PCA

3D Recognition

Object Detection

Empirische Inferenz Members Publications

Differential Geometry for Representation Learning

Ei report teaser — Left to right: We use differential geometry to provide a better prior for VAEs, to encode domain knowledge in generative models for improving interpretability and for robot motion skills. In addition, we develop computationally efficient methods for fitting statistical models and computing shortest paths on Riemannian data manifolds.

A common hypothesis in machine learning is that the data lie near a low dimensional manifold which is embedded in a high dimensional ambient space. This implies that shortest paths between points should respect the underlying geometric structure. In practice, we can capture the geometry of a data manifold through a Riemannian metric in the latent space of a stochastic generative model, relying on meaningful uncertainty estimation for the generative process. This enables us to compute identifiable distances, since the length of the shortest path remains invariant under re-parametrizations of the latent space. Consequently, we are able to study the learned latent representations beyond the classic Euclidean perspective. Our work is based on differential geometry and we develop computational methods accordingly.

Geometric priors in latent space Since the latent space can be characterized as non-Euclidean, we replace the standard Gaussian prior in Variational Auto-Encoders (VAEs) with a Riemannian Brownian motion prior, relying on an efficient inference scheme. In particular, our prior is the heat kernel of a Brownian motion process, where the normalization constant is trivial, and also we can easily generate samples and back-propagate gradients using the re-parametrization trick [].

Enriching the latent geometry The ambient space of a generative model is typically assumed to be Euclidean. Instead, we propose to consider it as a Riemannian manifold, which enables us to encode high-level domain knowledge through the associated metric. In this way, we are able to control the shortest paths and improve the interpretability of the learned representation. For instance, on the data manifold of human faces, we may influence the shortest path to prefer the smiling class while moving optimally on the manifold, by using an appropriate Riemannian metric in the ambient space [].

Probabilistic numerics on manifolds In general, operations on Riemannian manifolds are computationally demanding, so we are interested in efficient approximate solutions. We use adaptive Bayesian quadrature to numerically compute integrals over normal laws on Riemannian manifolds. The basic idea is to combine prior knowledge with an active exploration scheme to reduce the number of required costly evaluations. In addition, we develop a fast and robust fixed-point iteration scheme for solving the system of ordinary differential equations (ODE), which gives the shortest path between two points. The advantage of our approach is that compared to standard solvers, we avoid the Jacobians of the ODE, which is ill-behaved for Riemannian manifolds learned from data [].

Robot motion skills In robotic applications, the model learns motion skills such that to function in unstructured enviroments, it should be able to generalize under dynamic changes of the environment. For example, if an obstacle is introduced during the action, the robot should avoid it, while performing the task that it is supposed to do. We assume that human demonstrations span a data manifold on which shortest paths constitute natural motion skills. A robot then is able to plan movements through the associated shortest paths in the latent space of a VAE. Additionally, we can simply replace the Euclidean metric of the ambient space with a suitable Riemannian metric to account for dynamic obstacle avoidance tasks (R:SS '21 best student paper award) [].

Members

Empirische Inferenz

Georgios Arvanitidis

Probabilistic Numerics, Empirische Inferenz

Philipp Hennig

Affiliated Researcher

Publications

Empirical Inference Conference Paper Bayesian Quadrature on Riemannian Data Manifolds Fröhlich, C., Gessner, A., Hennig, P., Schölkopf, B., Arvanitidis, G. Proceedings of 38th International Conference on Machine Learning (ICML), 139:3459-3468, Proceedings of Machine Learning Research, (Editors: Meila, Marina and Zhang, Tong), PMLR, July 2021 (Published) URL BibTeX

Empirical Inference Conference Paper Learning Riemannian Manifolds for Geodesic Motion Skills Beik-Mohammadi, H., Hauberg, S., Arvanitidis, G., Neumann, G., Rozo, L. Robotics: Science and Systems XVII , (Editors: Dylan A. Shell and Marc Toussaint and M. Ani Hsieh), Robotics: Science and Systems 2021 (RSS 2021) , July 2021, * best student paper award (Published) DOI URL BibTeX

Empirical Inference Conference Paper Geometrically Enriched Latent Spaces Arvanitidis, G., Hauberg, S., Schölkopf, B. Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS), 130:631-639, Proceedings of Machine Learning Research, (Editors: Arindam Banerjee and Kenji Fukumizu), PMLR, AISTATS, April 2021 (Published) URL BibTeX

Empirical Inference Conference Paper Variational Autoencoders with Riemannian Brownian Motion Priors Kalatzis, D., Eklund, D., Arvanitidis, G., Hauberg, S. Proceedings of the 37th International Conference on Machine Learning (ICML), 119:5053-5066, Proceedings of Machine Learning Research, (Editors: Hal Daumé III and Aarti Singh), PMLR, July 2020 (Published) URL BibTeX

Probabilistic Numerics Empirical Inference Conference Paper Fast and Robust Shortest Paths on Manifolds Learned from Data Arvanitidis, G., Hauberg, S., Hennig, P., Schober, M. Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS), 89:1506-1515, (Editors: Kamalika Chaudhuri and Masashi Sugiyama), PMLR, April 2019 (Published) PDF URL BibTeX