Causal Representation Learning

Institute Homepage

Institute Homepage Sign In

Back

Research Overview

Inferring and exploiting contact

Generative Proxemics: A Prior for 3D Social Interaction from Images

BITE -- Dog Shape and Pose from an Image

HOLD -- inferring 3D hand and object shape from video

MOVER -- Reconstructing 3D Scenes and People using Interaction

Datasets for understanding humans and animals

The Poses for Equine Research Dataset (PFERD)

BEAT2 Dataset for Holistic Co-Speech Gesture Generation

ARCTIC: A Dataset for Dexterous Bimanual Hand-Object Manipulation

The BioAMASS Dataset

OpenCapBench dataset

Human health and the 3D body

Body Shape Models in Treating Anorexia Nervosa

Customized Bone Plants for Humerus Shaft Fractures

Reconstructing Signing Avatars From Video Using Linguistic Priors

The AI animator

HAAR: Text-Conditioned Generative Model of 3D Strand-based Human Hairstyles

Gaussian Garments

PuzzleAvatar: Assembling 3D Avatars from Personal Albums

FLARE: Fast Learning of Animatable and Relightable Mesh Avatars

Language, Vision, and World Models

AWOL: Analysis WithOut synthesis using Language

Re-Thinking Inverse Graphics with Large Language Models

TeCH: Text-guided Reconstruction of Clothed Humans

Human pose, shape, and motion capture

WHAM: Reconstructing World-grounded Humans with Accurate 3D Motion

3D Human Pose Estimation via Intuitive Physics

Accurate 3D Body Shape Regression using Metric and Semantic Attributes

BEV

Generating human motion

Generating Human Interaction Motions in Scenes with Text Control

TEMOS: Generating Diverse Human Motions from Text

EMAGE: Full-body Gestures from Audio

TEACH: Temporal Action Compositions for 3D Humans

Robot Perception Group

AirCap: 3D Motion Capture

AirCap: Perception-Based Control

AirCapRL: Aerial Motion Capture Using Deep RL

Data Team

Lab Tours and Public Outreach

Collecting Data - From the Idea to the Publication

Capture Technologies Setup

Completed Projects

Human Pose, Shape and Action

3D Pose from Images

2D Pose from Images

Beyond Motion Capture

Action and Behavior

Body Perception

Body Applications

Pose and Motion Priors

Clothing Models (2011-2015)

Reflectance Filtering

Learning on Manifolds

Markerless Animal Motion Capture

Multi-Camera Capture

2D Pose from Optical Flow

Body Perception

Neural Prosthetics and Decoding

Part-based Body Models

Intrinsic Depth

Lie Bodies

Layers, Time and Segmentation

Understanding Action Recognition (JHMDB)

Intrinsic Video

Intrinsic Images

Action Recognition with Tracking

Neural Control of Grasping

Flowing Puppets

Faces

Deformable Structures

Model-based Anthropometry

Modeling 3D Human Breathing

Optical flow in the LGN

FlowCap

Smooth Loops from Unconstrained Video

PCA Flow

Efficient and Scalable Inference

Motion Blur in Layers

Facade Segmentation

Smooth Metric Learning

Robust PCA

3D Recognition

Object Detection

Empirische Inferenz Members Publications

Causal Representation Learning

Causal rep combined — (a) Causal representation learning aims to infer abstract, high-level causal variables and their relations from low-level perceptual data such as images or other sensor measurements []. Recent work in this direction includes: (b) a proof that self-supervised learning isolates the invariant (content) representation c that is shared across views (e.g., obtained via data augmentation) []; (c) a method for extracting causal structure from trained deep generative models that allows for interventions leading to novel "hybrid" data []; and (d) a new instantiation of the principle of independent mechanisms suitable for unsupervised representation learning [].

Causal representation learning aims to move from statistical representations towards learning causal world models that support notions of intervention and planning, see Fig. (a) [].

Coarse-grained causal models Defining objects that are related by causal models typically amounts to appropriate coarse-graining of more detailed models of the world (e.g., physical models). Subject to appropriate conditions, causal models can arise, e.g., from coarse-graining of microscopic structural equation models [], ordinary differential equations [], temporally aggregated time series [], or temporal abstractions of recurrent dynamical models []. Although models in economics, medicine, or psychology typically involve variables that are abstractions of more elementary concepts, it is unclear when such coarse-grained variables admit causal models with well-defined interventions; [] provides some sufficient conditions.

Disentanglement A special case of causal representation learning is disentanglement, or nonlinear ICA, where the latent variables are assumed to be statistically independent. Through theoretical and large-scale empirical study, we have shown that disentanglement is not possible in a purely unsupervised setting [] (ICML'19 best paper). Follow-up works considered a semi-supervised setting [], and showed that disentanglement methods learn dependent latents when trained on correlated data [].

Multi-view learning Learning with multiple views of the data allows for overcoming the impossibility of purely-unsupervised representation learning, as demonstrated through identifiability results for multi-view nonlinear ICA [] and weakly-supervised disentanglement []. This idea also helps explain the impressive empirical success of self-supervised learning with data augmentations: we prove that the latter isolates the invariant part of the representation that is shared across views under arbitrary latent dependence, see Fig. (b) [].

Learning independent mechanisms For image recognition, we showed (by competitive training of expert modules) that independent mechanisms can transfer information across different datasets []. In an extension to dynamic systems, learning sparsely communicating, recurrent independent mechanisms (RIMs) led to improved generalization and strong performance on RL tasks []. Similar ideas have been useful for learning object-centric representations and causal generative scene models [].

Extracting causal structure from deep generative models We have devised methods for analysing deep generative models through a causal lens, e.g., for better extrapolation [] or creating hybridized counterfactual images, see Fig. (c) []. Causal ideas have also led to a new structured decoder architecture [] and new forms of gradient combination to avoid learning spurious correlations [].

New notions of non-statistical independence To use the principle of independent causal mechanisms as a learning signal, we have proposed two new notions of non-statistical independence: a general group-invariance framework that unifies several previous approaches [], and an orthogonality condition between partial derivatives tailored specifically for unsupervised representation learning, see Fig. (d) [].

Members

Empirische Inferenz

Julius von Kügelgen

Doctoral Researcher

Empirische Inferenz

Luigi Gresele

Doctoral Researcher

Empirische Inferenz

Michel Besserve

Senior Research Scientist

Empirische Inferenz

Felix Leeb

Doctoral Researcher

Empirische Inferenz

Bernhard Schölkopf

Director

Doctoral Researcher

Empirische Inferenz

Anirudh Goyal

Empirische Inferenz

Giambattista Parascandolo

Doctoral Researcher

Empirische Inferenz

Paul Rubenstein

Doctoral Researcher

Empirische Inferenz

Dominik Janzing

Research Scientist

Publications

Empirical Inference Conference Paper Unsupervised Object Learning via Common Fate Tangemann, M., Schneider, S., von Kügelgen, J., Locatello, F., Gehler, P., Brox, T., Kümmerer, M., Bethge, M., Schölkopf, B. Proceedings of the Second Conference on Causal Learning and Reasoning (CLeaR), 213:281-327, Proceedings of Machine Learning Research, (Editors: van der Schaar, Mihaela and Zhang, Cheng and Janzing, Dominik), PMLR, April 2023 (Published) arXiv URL BibTeX

Empirical Inference Conference Paper Independent mechanisms analysis, a new concept? Gresele*, L., von Kügelgen*, J., Stimper, V., Schölkopf, B., Besserve, M. Advances in Neural Information Processing Systems 34 (NeurIPS 2021), :28233-28248, (Editors: M. Ranzato and A. Beygelzimer and Y. Dauphin and P.S. Liang and J. Wortman Vaughan), Curran Associates, Inc., 35th Annual Conference on Neural Information Processing Systems, December 2021, *equal contribution (Published) arXiv URL BibTeX

Empirical Inference Conference Paper Self-supervised learning with data augmentations provably isolates content from style von Kügelgen*, J., Sharma*, Y., Gresele*, L., Brendel, W., Schölkopf, B., Besserve, M., Locatello, F. Advances in Neural Information Processing Systems 34 (NeurIPS 2021), :16451-16467, (Editors: M. Ranzato and A. Beygelzimer and Y. Dauphin and P.S. Liang and J. Wortman Vaughan), Curran Associates, Inc., 35th Annual Conference on Neural Information Processing Systems, December 2021, *equal contribution (Published) arXiv URL BibTeX

Empirical Inference Conference Paper Function Contrastive Learning of Transferable Meta-Representations Gondal, M. W., Joshi, S., Rahaman, N., Bauer, S., Wüthrich, M., Schölkopf, B. Proceedings of 38th International Conference on Machine Learning (ICML), 139:3755-3765, Proceedings of Machine Learning Research, (Editors: Meila, Marina and Zhang, Tong), PMLR, July 2021 (Published) URL BibTeX

Empirical Inference Conference Paper On Disentangled Representations Learned From Correlated Data Träuble, F., Creager, E., Kilbertus, N., Locatello, F., Dittadi, A., Goyal, A., Schölkopf, B., Bauer, S. Proceedings of 38th International Conference on Machine Learning (ICML), 139:10401-10412, Proceedings of Machine Learning Research, (Editors: Meila, Marina and Zhang, Tong), PMLR, July 2021 (Published) URL BibTeX

Empirical Inference Conference Paper Learning explanations that are hard to vary Parascandolo*, G., Neitz*, A., Orvieto, A., Gresele, L., Schölkopf, B. In 9th International Conference on Learning Representations (ICLR), May 2021, *equal contribution (Published) arXiv URL BibTeX

Empirical Inference Conference Paper Recurrent Independent Mechanisms Goyal, A., Lamb, A., Hoffmann, J., Sodhani, S., Levine, S., Bengio, Y., Schölkopf, B. In The Ninth International Conference on Learning Representations (ICLR), 9th International Conference on Learning Representations (ICLR 2021), May 2021 (Published) URL BibTeX

Empirical Inference Conference Paper A Theory of Independent Mechanisms for Extrapolation in Generative Models Besserve, M., Sun, R., Janzing, D., Schölkopf, B. In Proceedings of the 35th AAAI Conference on Artificial Intelligence , 35(8):6741-6749, 35th AAAI Conference on Artificial Intelligence (AAAI 2021), February 2021 (Published) arXiv DOI URL BibTeX

Empirical Inference Article Toward Causal Representation Learning Schölkopf*, B., Locatello*, F., Bauer, S., Ke, N. R., Kalchbrenner, N., Goyal, A., Bengio, Y. Proceedings of the IEEE, 109(5):612-634, 2021, *equal contribution (Published) DOI URL BibTeX

Empirical Inference Conference Paper Object-Centric Learning with Slot Attention Locatello, F., Weissenborn, D., Unterthiner, T., Mahendran, A., Heigold, G., Uszkoreit, J., Dosovitskiy, A., Kipf, T. Advances in Neural Information Processing Systems 33 (NeurIPS 2020), :11525-11538, (Editors: H. Larochelle and M. Ranzato and R. Hadsell and M. F. Balcan and H. Lin), Curran Associates, Inc., 34th Annual Conference on Neural Information Processing Systems, December 2020 (Published) URL BibTeX

Empirical Inference Conference Paper Weakly-Supervised Disentanglement Without Compromises Locatello, F., Poole, B., Rätsch, G., Schölkopf, B., Bachem, O., Tschannen, M. Proceedings of the 37th International Conference on Machine Learning (ICML), 119:6348-6359, Proceedings of Machine Learning Research, (Editors: Hal Daumé III and Aarti Singh), PMLR, July 2020 (Published) URL BibTeX

Empirical Inference Conference Paper Counterfactuals uncover the modular structure of deep generative models Besserve, M., Mehrjou, A., Sun, R., Schölkopf, B. 8th International Conference on Learning Representations (ICLR), April 2020 (Published) URL BibTeX

Empirical Inference Conference Paper Disentangling Factors of Variations Using Few Labels Locatello, F., Tschannen, M., Bauer, S., Rätsch, G., Schölkopf, B., Bachem, O. 8th International Conference on Learning Representations (ICLR), April 2020 (Published) arXiv URL BibTeX

Empirical Inference Conference Paper Towards causal generative scene models via competition of experts von Kügelgen*, J., Ustyuzhaninov*, I., Gehler, P., Bethge, M., Schölkopf, B. ICLR 2020 Workshop "Causal Learning for Decision Making", April 2020, *equal contribution (Published) arXiv PDF BibTeX

Empirical Inference Conference Paper The Incomplete Rosetta Stone problem: Identifiability results for Multi-view Nonlinear ICA Gresele*, L., Rubenstein*, P. K., Mehrjou, A., Locatello, F., Schölkopf, B. Proceedings of the 35th Conference on Uncertainty in Artificial Intelligence (UAI), 115:217-227, Proceedings of Machine Learning Research, (Editors: Adams, Ryan P. and Gogate, Vibhav), PMLR, July 2019, *equal contribution (Published) URL BibTeX

Empirical Inference Conference Paper Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations Locatello, F., Bauer, S., Lucic, M., Raetsch, G., Gelly, S., Schölkopf, B., Bachem, O. Proceedings of the 36th International Conference on Machine Learning (ICML), 97:4114-4124, Proceedings of Machine Learning Research, (Editors: Chaudhuri, Kamalika and Salakhutdinov, Ruslan), PMLR, June 2019 (Published) PDF URL BibTeX

Empirical Inference Conference Paper Adaptive Skip Intervals: Temporal Abstraction for Recurrent Dynamical Models Neitz, A., Parascandolo, G., Bauer, S., Schölkopf, B. Advances in Neural Information Processing Systems 31 (NeurIPS 2018), :9838-9848, (Editors: S. Bengio and H. Wallach and H. Larochelle and K. Grauman and N. Cesa-Bianchi and R. Garnett), Curran Associates, Inc., 32nd Annual Conference on Neural Information Processing Systems, December 2018 (Published) arXiv URL BibTeX

Empirical Inference Conference Paper From Deterministic ODEs to Dynamic Structural Causal Models Rubenstein, P. K., Bongers, S., Schölkopf, B., Mooij, J. M. Proceedings of the 34th Conference on Uncertainty in Artificial Intelligence (UAI), :114-123, (Editors: Globerson, Amir and Silva, Ricardo), August 2018 (Published) Arxiv URL BibTeX

Empirical Inference Conference Paper Learning Independent Causal Mechanisms Parascandolo, G., Kilbertus, N., Rojas-Carulla, M., Schölkopf, B. Proceedings of the 35th International Conference on Machine Learning (ICML), 80:4033-4041, Proceedings of Machine Learning Research, (Editors: Dy, Jennifer and Krause, Andreas), PMLR, July 2018 (Published) URL BibTeX

Empirical Inference Conference Paper Group invariance principles for causal generative models Besserve, M., Shajarisales, N., Schölkopf, B., Janzing, D. Proceedings of the 21st International Conference on Artificial Intelligence and Statistics (AISTATS), 84:557-565, Proceedings of Machine Learning Research, (Editors: Amos Storkey and Fernando Perez-Cruz), PMLR, April 2018 (Published) URL BibTeX

Empirical Inference Conference Paper Causal Consistency of Structural Equation Models Rubenstein*, P. K., Weichwald*, S., Bongers, S., Mooij, J. M., Janzing, D., Grosse-Wentrup, M., Schölkopf, B. Proceedings of the 33rd Conference on Uncertainty in Artificial Intelligence (UAI), :ID 11, (Editors: Gal Elidan, Kristian Kersting, and Alexander T. Ihler), August 2017, *equal contribution (Published) Arxiv PDF URL BibTeX

Empirical Inference Conference Paper Causal Discovery from Temporally Aggregated Time Series Gong, M., Zhang, K., Schölkopf, B., Glymour, C., Tao, D. Proceedings of the 33rd Conference on Uncertainty in Artificial Intelligence (UAI), :ID 269, (Editors: Gal Elidan, Kristian Kersting, and Alexander T. Ihler), August 2017 (Published) URL BibTeX