Events & Talks

Haptic Intelligence IS Colloquium 09-11-2021 Simulating the GelSight Sensors: From Physics to Data In recent years, vision-based high-resolution tactile sensors such as GelSight have been widely used because the rich signal provides useful information regarding the state of the robot and the environment. However, a major barrier for applying tactile sensors like GelSight is the cost. To make the tactile sensors more accessible to a broader community, we propose to build a simulation model for vision-based tactile sensors like GelSight. We explore two modeling methods for making the model: a physically-based method that uses rendering technologies to simulate the sensor's optical design a... Katherine J. Kuchenbecker
Thumb ticker sm wenzhen
Perceiving Systems Talk 05-10-2021 Toward Reconstructing Face from Voice We address a new challenge posed by voice profiling - reconstructing someone’s face from their voice. Specifically, given an audio clip spoken by an unseen person, we aim to reconstruct a face that has as many associations as possible with the speaker in terms of identity. In this talk, I will introduce how we explore and approach the ultimate goal step by step. First, we investigate the audio-visual association by matching voices to faces based on identity, and vice versa. Second, we set up a baseline for reconstructing 2D face images from a voice recording and show reasonable reconstructi... Timo Bolkart
Thumb ticker sm yangdong
Event 30-09-2021 Anniversary: 10 years MPI-IS & 100 years MPI-MF The Max Planck Institute for Intelligent Systems in Stuttgart and T&uuml;bingen is celebrating a double anniversary this year: As one of the oldest and largest institutes of the Max Planck Society, the Max Planck Institute for Metals Research celebrates its 100th anniversary. Simultaneously, we are also celebrating the 10th anniversary of the Max Planck Institute for Intelligent Systems, which emerged from the scientific realignment of the institute in 2011. <p class="p-text-lg">Please join us in celebrating this historic event for our institute!</p> Bernhard Schölkopf Matthias Tröndle Barbara Kettemann Oliwia Gust Linda Behringer Claudia Daefler Alejandro Posada Nassim Taghipour Katherine J. Kuchenbecker Gisela Schütz
Thumb ticker sm bild neu
Perceiving Systems Talk 28-09-2021 DeepMultiCap & Lightweight Multi-person Total Motion Capture Using Sparse Multi-view Cameras We propose DeepMultiCap, a novel method for multi-person performance capture using sparse multi-view cameras. Our method can capture time varying surface details without the need of using pre-scanned template models. To tackle the serious occlusion challenge for close interacting scenes, we combine a recently proposed pixel-aligned implicit function with a parametric model for robust reconstruction of the invisible surface areas. An effective attention-aware module is designed to obtain the fine-grained geometry details from multi-view images, where high-fidelity results can be generated. I... Chun-Hao Paul Huang
Thumb ticker sm deepcap
Perceiving Systems Talk 27-09-2021 Refraction and Absorption for Underwater Shape Recovery In this talk the speaker will present her work on the recovery of rigid and deformable 3D shape from underwater images. Silvia Zuffi
Thumb ticker sm jenny6
Perceiving Systems Talk 22-09-2021 Learning motion priors for 4D human body capture in 3D scenes It is challenging to recover realistic human-scene interactions and high-quality human motions while dealing with occlusions and partial views with a monocular RGB(D) camera. We address this problem by learning motion smoothness and infilling priors from the large scale mocap dataset AMASS, to reduce the jitters, and handle contacts and occlusions, respectively. Furthermore, we combine them into a multi-stage optimization pipeline for the high quality 4D human capture in complex 3D scenes. Chun-Hao Paul Huang
Thumb ticker sm 1604521023850
Perceiving Systems Talk 06-09-2021 From skeleton to body: Keypoint Estimation is Helpful for Human Body Reconstruction My works mainly lie in inferring human structures from RGB inputs, which starts from 2D keypoint estimation, towards more complex tasks like 3D skeleton inference and SMPL-based human pose & shape estimation. Along this road, we find that high-level tasks, like human body estimation, can benefit a lot from low-level inferred structures, like 3D skeletons, and vice versa. Furthermore, in our latest work, "Human Pose Regression with Residual Log-likelihood Estimation", we unified all the above HPS tasks in a direct regression paradigm, replacing generally accepted heatmap without loss of accu... Yuliang Xiu
Thumb ticker sm bio photo 2
Haptic Intelligence PhD Thesis Defense 12-08-2021 HuggieBot: An Interactive Hugging Robot with Visual and Haptic Perception Hugs are one of the first forms of contact and affection humans experience. Receiving a hug is one of the best ways to feel socially supported, and the lack of social touch can have severe adverse effects on an individual's well-being. Due to the prevalence and health benefits of hugging, roboticists are interested in creating robots that can hug humans as seamlessly as humans hug other humans. However, hugs are complex affective interactions that need to adapt to the height, body shape, and preferences of the hugging partner, and they often include intra-hug gestures like squeezes. This di... Alexis Block Katherine J. Kuchenbecker
Thumb ticker sm alexisblockheadshot
Perceiving Systems Talk 27-07-2021 Modeling 3D Human Motion for Improved Pose Estimation Though substantial progress has been made in estimating 3D human poses from dynamic observations, recent methods still struggle to recover physically-plausible motions, and the presence of noise and occlusions remains challenging. In this talk, I'll introduce two methods that tackle these issues by leveraging models of 3D human motion - one physics-based and one learned. In the first approach, an initial 3D motion is refined using a physics-based trajectory optimization that leverages automatically-detected foot contacts from RGB video. In the second, a learned generative model is used as a... Muhammed Kocabas
Thumb ticker sm headshot cropped
Perceiving Systems Talk 26-07-2021 AI SYNTHESIS: FROM AVATARS TO 3D SCENES In this talk I will motivate how digital humans will impact the future of communication, human-machine interaction, and content creation. I will present our latest 3D avatar digitization technology from Pinscreen from a single photo, and give a live demonstration. I will also showcase how we use hybrid CG and neural rendering solutions for real-time applications used in next generation virtual assistant and virtual production pipelines. I will then present a real-time teleportation system that only uses a single webcam as input, and our latest efforts at UC Berkeley in real-time AI synthesi... Yao Feng
Thumb ticker sm haoli portrait
Event 22-07-2021 2021 Ph.D. Graduation Ceremony With this Graduation Ceremony, we celebrate the success of 21 young researchers who successfully defended their doctoral theses during the past 12 months. Six world-class universities are awarding doctoral degrees to this year’s batch of graduating students, who are affiliated either directly with MPI-IS or indirectly through our doctoral programs (Cam-Tue, CLS, CMU, and IMPRS-IS). Cäcilia Heinisch Sarah Danes Sara Sorce Leila Masri
Thumb ticker sm grad web
Perceiving Systems Talk 14-07-2021 TRANSPR: Transparency Ray-Accumulating Neural 3D Scene Point Renderer We propose and evaluate a neural point-based graphics method that can model semi-transparent scene parts. Similarly to its predecessor pipeline, ours uses point clouds to model proxy geometry, and augments each point with a neural descriptor. Additionally, a learnable transparency value is introduced in our approach for each point. Our neural rendering procedure consists of two steps. Firstly, the point cloud is rasterized using ray grouping into a multi-channel image. This is followed by the neural rendering step that "translates" the rasterized image into an RGB output using a learnable ... Qianli Ma
Thumb ticker sm maria kolos 600p
IS Colloquium 28-06-2021 Teaching Robots to See – Challenges and Developments in Robotic Vision As vision plays a key role in how we interpret a situation, developing vision-based perception for robots promises to be a big step towards robotic intelligence. This talk will briefly discuss some of the biggest challenges we are faced with all the way from robust localization and mapping, to dense scene representation for path planning, and collaborative perception. With effective robot collaboration featuring as a key scientific challenge in the field, the talk will focus on this topic describing our recent progress in this area at the Vision for Robotics Lab of ETH Zurich (http://www.v4... Katherine J. Kuchenbecker Oliwia Gust
Thumb ticker sm person detail.person image
Perceiving Systems Talk 14-06-2021 Using Generative Models for Faces to Test Neural Networks Most machine learning models are validated on fixed datasets. This can give an incomplete picture of the capabilities and weaknesses of the model. Such weaknesses can be revealed at test time in the real world with dire consequences. In order to alleviate this issue, simulators can be controlled in a fine-grained manner using interpretable parameters to explore the semantic image manifold and discover such weaknesses before deploying a model. Also, in recent years there have been important advances in generative models for computer vision resulting in realistic face generation and manipulat... Timo Bolkart
Thumb ticker sm headshot circle 2
Perceiving Systems Talk 10-06-2021 Learning Skeletal Articulations with Neural Blend Shapes Animating a newly designed character using motion capture (mocap) data is a long standing problem in computer animation. A key consideration is the skeletal structure that should correspond to the available mocap data, and the shape deformation in the joint regions, which often requires a tailored, pose-specific refinement. In this work, we develop a neural technique for articulating 3D characters using enveloping with a pre-defined skeletal structure which produces high quality pose dependent deformations. Our framework learns to rig and skin characters with the same articulation structure... Hongwei Yi
Thumb ticker sm avatar hu28c518729dfa1b2cc62a2415a1cb950b 130865 540x540 fill q100 lanczos center
Rationality Enhancement Conference 09-06-2021 - 13-06-2021 Life Improvement Science Conference Life Improvement Science (LIS) is an emerging transdisciplinary research field that investigates how we can help people do more good in better ways (well-doing). Falk Lieder Mike Prentice Pin-Zhen Chen Anastasia Lado Sierra Kaiser Victoria Amo
Thumb ticker sm lis featured image facebook post  3
Perceiving Systems Talk 01-06-2021 Real-time Deep Dynamic Characters Animatable and photo-realistic virtual 3D characters are of enormous importance nowadays. However, generating realistic characters still requires manual intervention, expensive equipment, and the resulting characters are either difficult to control or not realistic. Therefore, the goal of the work, that is presented within the talk, is to learn digital characters which are both realistic and easy to control and can be learned directly from a multi-view video. To this end, I will introduce a deep videorealistic 3D human character model displaying highly realistic shape, motion, and dynamic a... Yinghao Huang Chun-Hao Paul Huang
Thumb ticker sm marchabermann
Perceiving Systems Talk 12-04-2021 Expressive Whole-Body 3D Multi-Person Pose and Shape Estimation from a Single Image Human is the most centric and interesting object in our life: many human-centric techniques and studies have been proposed from both industry and academia, such as virtual try-on, 3D personal avatar, and marker-less motion capture in the movie/game industry, including AR/VR. Recovery of accurate 3D geometry of humans (i.e., 3D human pose and shape) is a key component of the human-centric techniques and studies. In particular, the 3D pose and shape of multiple persons can deliver relative 3D location between persons. Also, the 3D pose and shape of the whole body, which includes hands and fac... Chun-Hao Paul Huang
Thumb ticker sm mks0601
Perceiving Systems Talk 08-04-2021 Pushing the Boundaries of Novel View Synthesis 2020 was a turbulent year, but for 3D learning it was a fruitful one with lots of exciting new tools and ideas. In particular, there have been many exciting developments in the area of coordinate based neural networks and novel view synthesis. In this talk I will discuss our recent work on single image view synthesis with pixelNeRF, which aims to predict a Neural Radiance Field (NeRF) from a single image. I will discuss how NeRF representation allows models like pixel-aligned implicit functions (PiFu) to be trained without explicit 3D supervision and the importance of other key design fact... Qianli Ma
Thumb ticker sm pxl 20210329 002743491.portrait  1  small
Perceiving Systems Talk 01-04-2021 Hair & garment synthesis using deep learning method For both AR and VR applications, there is a strong motivation to generate virtual avatars with realistic hairs and garments that are the two most significant elements to personify any character. However, due to the complex structures and ever-changing fashion styles, modeling hairs and garments still remain tedious and expensive as they require considerable professional effort. My research interest focuses on deep learning methods in 3D modeling, rendering, and animation, especially to synthesis high-quality hairs and garments with plausible details. In this talk, I will present the progres... Jinlong Yang
Thumb ticker sm meng
Perceiving Systems Talk 22-02-2021 Joint Learning Over Visual and Geometric Data Many challenges remain in applying machine learning to domains where obtaining massive annotated data is difficult. We discuss approaches that aim to reduce supervision load for learning algorithms in the visual and geometric domains by leveraging correlations among data as well as among learning tasks -- what we call joint learning. The basic notion is that inference problems do not occur in isolation but rather in a "social context" that can be exploited to provide self-supervision by enforcing consistency, thus improving performance and increasing sample efficiency. An example is voting ... Qianli Ma
Thumb ticker sm guibas su
Perceiving Systems Talk 10-02-2021 AI Choreographer: Learn to dance with AIST++ In this work, we present a transformer-based learning framework for 3D dance generation conditioned on music. We carefully design our network architecture and empirically study the keys for obtaining qualitatively pleasing results. In addition, we propose a new dataset of paired 3D motion and music called AIST++, which contains 1.1M frames of 3D dance motion in 1408 sequences, covering 10 genres of dance choreographies and accompanied with multi-view camera parameters. To our knowledge it is the largest dataset of this kind. Yuliang Xiu
Thumb ticker sm photo
Physical Intelligence Talk 09-02-2021 Phosphonate-MOFs,-HOFs: Next generation of microporous compounds Among the other metal organic framework (MOF) familieis, phosphonic acids provide the richest metal binding for MOF synthesis, and they are known to exhibit exceptional thermal and chemical stabilities. Due to the synthetic difficulties, the total number of microporous phosphonate-MOFs are still limited in the literature. In this work, we provide design and synthesis strategies to form semiconductive, proton conductive microporous MOFs and hydrogen bonded organic frameworks (HOFs) constructed using structure directing arylphosphonic acids. Metin Sitti
Thumb ticker sm g  ndog y  cesan
Perceiving Systems Talk 28-01-2021 Creating, Weaponizing, and Detecting Deep Fakes The past few years have seen a startling and troubling rise in the fake-news phenomena in which everyone from individuals to nation-sponsored entities can produce and distribute misinformation. The implications of fake news range from a misinformed public to an existential threat to democracy, and horrific violence. At the same time, recent and rapid advances in machine learning are making it easier than ever to create sophisticated and compelling fake images, videos, and audio recordings, making the fake-news phenomena even more powerful and dangerous. These AI-synthesized media (so-called... Jinlong Yang
Thumb ticker sm img 8256
Symposium 27-01-2021 - 28-01-2021 IMPRS-IS 2021 Symposium Keynotes The 2021 IMPRS-IS Interview Symposium will feature two keynote presentations open to our entire community. Speakers include Dr. Ulrike von Luxburg of the University of Tübingen and Dr. Christoph Keplinger representing the Max Planck Institute for Intelligent Systems. Leila Masri Sara Sorce
Thumb ticker sm thumb ticker md adobestock 282943017
Perceiving Systems Talk 27-01-2021 Towards a more holistic understanding of scene, object, and human Humans, even young infants, are adept at perceiving and understanding complex indoor scenes. Such an incredible vision system relies on not only the data-driven pattern recognition but also roots from the visual reasoning system, known as the core knowledge, that facilitates the 3D holistic scene understanding tasks. This talk discusses how to employ physical common sense and human-object interaction to bridge scene and human understanding and how the part-level 3D affordance perception may lead to a more fine-grained human-object interaction modeling. Future directions may be extended to d... Dimitris Tzionas
Thumb ticker sm yxchen
Talk 21-01-2021 Non-Rigid Shape Correspondence through Deformation Solving for 3D correspondences beyond isometries has made tremendous progress in recent years, much of it due to (deep) learning. However, not all applications provide the necessary training data. This talk will focus on how far we can take the results without learning. I will present a line of work that poses the non-rigid shape registration problem in terms of physical and non-physical deformation energies. Our work aims to combine extrinsic and intrinsic measures to overcome typical shortcomings of both. We use Functional Maps and Markov Chain Monte Carlo initialization to handle all kin... Jinlong Yang
Thumb ticker sm zorah
Perceiving Systems Talk 21-01-2021 Non-Rigid Shape Correspondence through Deformation Solving for 3D correspondences beyond isometries has made tremendous progress in recent years, much of it due to (deep) learning. However, not all applications provide the necessary training data. This talk will focus on how far we can take the results without learning. I will present a line of work that poses the non-rigid shape registration problem in terms of physical and non-physical deformation energies. Our work aims to combine extrinsic and intrinsic measures to overcome typical shortcomings of both. We use Functional Maps and Markov Chain Monte Carlo initialization to handle all kin... Jinlong Yang
Thumb ticker sm zorah
Perceiving Systems Talk 14-12-2020 A Future with Self-Driving Vehicles We are on the verge of a new era in which robotics and artificial intelligence will play an important role in our daily lives. Self-driving vehicles have the potential to redefine transportation as we understand it today. Our roads will become safer and less congested, while parking spots will be repurposed as leisure zones and parks. However, many technological challenges remain as we pursue this future. In this talk I will showcase the latest advancements made by Uber Advanced Technologies Group’s in the quest towards self-driving vehicles at scale. Qianli Ma
Thumb ticker sm raquel
Talk 18-11-2020 Electronic Tattoos for Mobile Sensing and Therapeutics Merging human body with electronics and machines can enable internet of health (IoH), human-machine interface (HMI), as well as augmented human capabilities. However, bio-tissues are soft, curvilinear and dynamic whereas wafer-based electronics are hard, planar, and rigid. Over the past decade, stretchable high-performance inorganic electronics blossom as a result of innovative structural designs and fabrication processes. In particular, epidermal electronics, a.k.a. electronic tattoos (e-tattoos) represent a class of noninvasive stretchable circuits, sensors, and stimulators that are ultra... Metin Sitti
Thumb ticker sm nanshu lu
Haptic Intelligence PhD Thesis Defense 06-10-2020 Delivering Expressive and Personalized Fingertip Tactile Cues Wearable haptic devices have seen growing interest in recent years, but providing realistic tactile feedback is not a challenge that is soon to be solved. Daily interactions with physical objects elicit complex sensations at the fingertips. Furthermore, human fingertips exhibit a broad range of physical dimensions and perceptive abilities, adding increased complexity to the task of simulating haptic interactions in a compelling manner. However, as the applications of wearable haptic feedback grow, concerns of wearability and generalizability often persuade tactile device designers to simpli... Katherine J. Kuchenbecker Eric Young
Thumb ticker sm 20191119 mpiis stgt kuchenbecker 019  1
Perceiving Systems Talk 05-10-2020 The phenotyping revolution One of the most striking characteristics of human behavior in contrast to all other animal is that we show extraordinary variability across populations. Human cultural diversity is a biological oddity. More specifically, we propose that what makes humans unique is the nature of the individual ontogenetic process, that results in this unparalleled cultural diversity. Hence, our central question is: How is human ontogeny adapted to cultural diversity and how does it contribute to it? This question is critical, because cultural diversity does not only entail our predominant mode of adaptation ... Timo Bolkart
Thumb ticker sm daniel foto
Perceiving Systems Talk 02-10-2020 Reconstructing the Plenoptic Function Imagine a futuristic version of Google Street View that could dial up any possible place in the world, at any possible time. Effectively, such a service would be a recording of the plenoptic function—the hypothetical function described by Adelson and Bergen that captures all light rays passing through space at all times. While the plenoptic function is completely impractical to capture in its totality, every photo ever taken represents a sample of this function. I will present recent methods we've developed to reconstruct the plenoptic function from sparse space-time samples of photos—inclu...
Thumb ticker sm noah2019
Autonomous Vision Event 28-09-2020 - 01-10-2020 German Conference on Pattern Recognition DAGM-GCPR 2020 in Tübingen The 42nd German Conference on Pattern Recognition (DAGM-GCPR 2020), the 25th International Symposium on Vision, Modeling and Visualization (VMV 2020) and the 10th Eurographics Workshop on Visual Computing for Biology and Medicine (VCBM 2020) will for the first time be co-located in Tübingen this year! Andreas Geiger
Thumb ticker sm image23
Event 31-08-2020 Scientific Symposium 2020 All current and former employees and friends of the Max Planck Institute fo Intelligent Systems and Cyber Valley are welcome to attend this event. If you have any questions, please contact our Event Manager - Oliwia Gust (oliwia.gust@cyber-valley.de) Michael Black Katherine J. Kuchenbecker Bernhard Schölkopf Metin Sitti Florian Mayer Matthias Tröndle
Thumb ticker sm bildschirmfoto 2020 08 17 um 12.44.26
Perceiving Systems Talk 10-08-2020 Functions, Machine Learning, and Game Development Game Development requires a vast array of tools, techniques, and expertise, ranging from game design, artistic content creation, to data management and low level engine programming. Yet all of these domains have one kind of task in common - the transformation of one kind of data into another. Meanwhile, advances in Machine Learning have resulted in a fundamental change in how we think about these kinds of data transformations - allowing for accurate and scalable function approximation, and the ability to train such approximations on virtually unlimited amounts of data. In this talk I will p... Abhinanda Ranjit Punnakkal
Thumb ticker sm pic daniel holden
Perceiving Systems Talk 07-08-2020 Our Recent Research on 3D Deep Learning I will present three recent projects within the 3D Deep Learning research line from my team at Google Research: (1) a deep network for reconstructing the 3D shape of multiple objects appearing in a single RGB image (ECCV'20). (2) a new conditioning scheme for normalizing flow models. It enables several applications such as reconstructing an object's 3D point cloud from an image, or the converse problem of rendering an image given a 3D point cloud, both within the same modeling framework (CVPR'20); (3) a neural rendering framework that maps a voxelized object into a high quality image. It re... Yinghao Huang Arjun Chandrasekaran
Thumb ticker sm vittero
Perceiving Systems Talk 28-07-2020 Learning from vision, touch and audition Babies learn with very little supervision, and, even when supervision is present, it comes in the form of an unknown spoken language that also needs to be learned. How can kids make sense of the world? In this work, I will show that an agent that has access to multimodal data (like vision, audition or touch) can use the correlation between images and sounds to discover objects in the world without supervision. I will show that ambient sounds can be used as a supervisory signal for learning to see and vice versa (the sound of crashing waves, the roar of fast-moving cars – sound conveys impor... Arjun Chandrasekaran
Thumb ticker sm antoniotorralbas
Haptic Intelligence Talk 28-07-2020 Tactile Sensing, Information, and Feedback via Wave Propagation A longstanding goal of engineering has been to realize haptic interfaces that can convey realistic sensations of touch, comparable to signals presented via visual or audio displays. Today, this ideal remains far from realization, due to the difficulty of characterizing and electronically reproducing the complex and dynamic tactile signals that are produced during even the simplest touch interactions. In this talk, I will present my work on capturing whole-hand tactile signals, in the form of mechanical waves, produced during natural hand interactions. I will describe how I characterized the... Katherine J. Kuchenbecker
Thumb ticker sm yshaoprofilephoto
Event 24-07-2020 2020 Intelligent Systems Summer Colloquium (Virtual Event) MPI-IS cordially invites you to attend the 2020 Intelligent Systems Summer Colloquium
Thumb ticker sm artboard 8
Physical Intelligence Talk 22-07-2020 Large-Area Fabrication of Nanoscale Features by UV-NIL @ JR MATERIALS Roll-to-roll UV nanoimprint lithography (R2R-UV-NIL) gains increasing industrial interest for large area nano- and micro-structuring of flexible substrates because it combines nanometer resolution with many square meter per minute productivity. Small-area masters of functional nano and micro surface structures are readily available by various lithographic techniques like e.g. UV-, e-beam- or interference lithography. However, the upscaling of small-area nano- and micro-structured masters into medium size roller molds – often called shims - for R2R-UV-NIL production still remains a bottlene...
Thumb ticker sm dieter nees
Perceiving Systems Talk 16-07-2020 Towards Commodity 3D Scanning for Content Creation In recent years, commodity 3D sensors have become widely available, spawning significant interest in both offline and real-time 3D reconstruction. While state-of-the-art reconstruction results from commodity RGB-D sensors are visually appealing, they are far from usable in practical computer graphics applications since they do not match the high quality of artist-modeled 3D graphics content. One of the biggest challenges in this context is that obtained 3D scans suffer from occlusions, thus resulting in incomplete 3D models. In this talk, I will present a data-driven approach towards genera... Yinghao Huang
Thumb ticker sm picture
Perceiving Systems Talk 13-07-2020 Learning from videos played forwards, backwards, fast, and slow How can we tell that a video is playing backwards? People's motions look wrong when the video is played backwards--can we develop an algorithm to distinguish forward from backward video? Similarly, can we tell if a video is sped-up? We have developed algorithms to distinguish forwards from backwards video, and fast from slow. Training algorithms for these tasks provides a self-supervised task that facilitates human activity recognition. We'll show these results, and applications of these unsupervised video learning tasks, including a method to change the timing of people in videos. Yinghao Huang
Thumb ticker sm bfreeman
Perceiving Systems Talk 02-07-2020 Real-time Multi-person 3D Motion Capture with a Single RGB Camera In our recent work, XNect, we propose a real-time solution for the challenging task of multi-person 3D human pose estimation from a single RGB camera. To achieve real-time performance without compromising on accuracy, our approach relies on a new efficient Convolutional Neural Network architecture, and a multi-staged pose formulation. The CNN architecture is approx. 1.3x faster than ResNet-50, while achieving the same accuracy on various tasks, and the benefits extend beyond inference speed to a much smaller training memory footprint and a much higher training throughput. The proposed pose ... Yinghao Huang
Thumb ticker sm gvv team metha dushyant
Max Planck Lecture 16-06-2020 The sound of fermions Fermions, particles with half-integer spin like the electron, proton and neutron, obey the Pauli principle: They cannot share one and the same quantum state. This “anti social” behavior is directly observed in experiments with ultracold gases of fermionic atoms: Pauli blocking in momentum space for a free Fermi gas, and in real space in gases confined to an optical lattice. When fermions interact, new, rather “social” behavior emerges, i.e. hydrodynamic flow, superfluidity and magnetism. The interplay of Pauli’s principle and strong interactions poses great difficulties to our understanding...
Thumb ticker sm zwierlein martin
Perceiving Systems Talk 10-06-2020 Canonicalization for 3D Perception In this talk, I will introduce the notion of 'canonicalization' and how it can be used to solve 3D computer vision tasks. I will describe Normalized Object Coordinate Space (NOCS), a 3D canonical container that we have developed for 3D estimation, aggregation, and synthesis tasks. I will demonstrate how NOCS allows us to address previously difficult tasks like category-level 6DoF object pose estimation, and correspondence-free multiview 3D shape aggregation. Finally, I will discuss future directions including opportunities to extend NOCS for tasks like articulated and non-rigid shape and po... Timo Bolkart
Thumb ticker sm srinath
Haptic Intelligence Talk 09-06-2020 Robotic Manipulation: a Focus on Object Handovers Humans perform object manipulation in order to execute a specific task. Seldom is such action started with no goal in mind. In contrast, traditional robotic grasping (first stage for object manipulation) seems to focus purely on getting hold of the object—neglecting the goal of the manipulation. In this light, most metrics used in robotic grasping do not account for the final task in their judgement of quality and success. Since the overall goal of a manipulation task shapes the actions of humans and their grasps, the task itself should shape the metric of success. To this end, I will pre... Katherine J. Kuchenbecker
Thumb ticker sm valerio photo2
Max Planck Lecture 09-06-2020 Towards spectro-microscopy at extreme limits This talk is devoted to modern methods for attosecond and femtosecond laser spectro-microscopy with the special focus on applications that require extreme spatial resolution. In the first part, I discuss how high-harmonic generation by high-energy, high-power light transients holds promise to deliver the required photon flux and photon energy for attosecond pump-probe spectroscopy at high spatiotemporal resolution in order to capture electron-dynamic in matter. I demonstrate the first prototype high-energy field synthesizer based on Yb:YAG, thin-disk laser technology for generating high...
Thumb ticker sm 20170303 072343 31361
Talk 18-05-2020 AirCap – Aerial Outdoor Motion Capture In this talk I will present an overview and the latest results of the project Aerial Outdoor Motion Capture (AirCap), running at the Perceiving Systems department. AirCap's goal is to achieve markerless and unconstrained human motion capture (MoCap) in unknown and unstructured outdoor environments. To this end, we have developed a flying MoCap system using a team of autonomous aerial robots with on-board, monocular RGB cameras. Our system is endowed with a range of novel functionalities which was developed by our group over the last 3 years. These include, i) cooperative detection and track... Katherine J. Kuchenbecker
Thumb ticker sm aamir