Conference Paper 2020

On the Transfer of Inductive Biasfrom Simulation to the Real World: a New Disentanglement Dataset

{Learning meaningful and compact representations with structurally disentangled semantic aspects is considered to be of key importance in representation learning. Since real-world data is notoriously costly to collect, many recent state-of-the-art disentanglement models have heavily relied on synthetic toy data-sets. In this paper, we propose a novel data-set which consists of over 1 million images of physical 3D objects with seven factors of variation, such as object color, shape, size and position. In order to be able to control all the factors of variation precisely, we built an experimental platform where the objects are being moved by a robotic arm. In addition, we provide two more datasets which consist of simulations of the experimental setup. These datasets provide for the first time the possibility to systematically investigate how well different disentanglement methods perform on real data in comparison to simulation, and how simulated data can be leveraged to build better representations of the real world. We provide a first experimental study of these questions and our results indicate that learned models transfer poorly, but that model and hyperparameter selection is an effective means of transferring information to the real world.}

Author(s): Gondal, MW and Wuthrich, M and Miladinovic, D and Locatello, F and Breidt, M and Volchkov, V and Akpo, J and Bachem, O and Schölkopf, B and Bauer, S
Book Title: Advances in Neural Information Processing Systems 32
Pages: 15661--15672
Year: 2020
Publisher: Curran
Bibtex Type: Conference Paper (inproceedings)
Address: Vancouver, Canada
Electronic Archiving: grant_archive

BibTex

@inproceedings{item_3162653,
  title = {{On the Transfer of Inductive Biasfrom Simulation to the Real World: a New Disentanglement Dataset}},
  booktitle = {{Advances in Neural Information Processing Systems 32}},
  abstract = {{Learning meaningful and compact representations with structurally disentangled semantic aspects is considered to be of key importance in representation learning. Since real-world data is notoriously costly to collect, many recent state-of-the-art disentanglement models have heavily relied on synthetic toy data-sets. In this paper, we propose a novel data-set which consists of over 1 million images of physical 3D objects with seven factors of variation, such as object color, shape, size and position. In order to be able to control all the factors of variation precisely, we built an experimental platform where the objects are being moved by a robotic arm. In addition, we provide two more datasets which consist of simulations of the experimental setup. These datasets provide for the first time the possibility to systematically investigate how well different disentanglement methods perform on real data in comparison to simulation, and how simulated data can be leveraged to build better representations of the real world. We provide a first experimental study of these questions and our results indicate that learned models transfer poorly, but that model and hyperparameter selection is an effective means of transferring information to the real world.}},
  pages = {15661--15672},
  publisher = {Curran},
  address = {Vancouver, Canada},
  year = {2020},
  slug = {item_3162653},
  author = {Gondal, MW and Wuthrich, M and Miladinovic, D and Locatello, F and Breidt, M and Volchkov, V and Akpo, J and Bachem, O and Sch\"olkopf, B and Bauer, S}
}