Optical Flow and Human Action

Institute Homepage

Institute Homepage Sign In

Back

Research Overview

Learning Control

Learning Coupling Terms of Movement Primitives

Incremental Local Regression

Perception for Action

Autonomous Robotic Manipulation

Modeling Top-Down Saliency for Visual Object Search

Interactive Perception

State Estimation and Sensor Fusion for the Control of Legged Robots

Probabilistic Object and Manipulator Tracking

Global Object Shape Reconstruction by Fusing Visual and Tactile Data

Robot Arm Pose Estimation as a Learning Problem

Learning to Grasp from Big Data

Gaussian Filtering as Variational Inference

Template-Based Learning of Model Free Grasping

Associative Skill Memories

Real-Time Perception meets Reactive Motion Generation

Motion planning and control

Autonomous Robotic Manipulation

Learning Coupling Terms of Movement Primitives

State Estimation and Sensor Fusion for the Control of Legged Robots

Inverse Optimal Control

Motion Optimization

Optimal Control for Legged Robots

Movement Representation for Reactive Behavior

Associative Skill Memories

Real-Time Perception meets Reactive Motion Generation

Neural Control of Movement

Experimental Robotics

Autonomous Robotic Manipulation

Inverse Optimal Control

Motion Optimization

Optimal Control for Legged Robots

Associative Skill Memories

Real-Time Perception meets Reactive Motion Generation

Other

Perceiving Systems Members Publications

Optical Flow and Human Action

Sab 2016 2021 humanflow — (Top) We learn human flow [] from synthetically generated flow fields and find that this generalizes to real videos of human movement. (Bottom) We fine tune an optical flow algorithm to produce flow that improves action recognition []. (Left columns) SpyNet. (Right columns) FlowNet. In each set, left to right: first image in sequence, original flow, flow when trained on action recognition, differences in the flow are focused on the human action.

Understanding human action requires modeling and understanding human movement. While we mostly focus on 3D human movement, what is directly observable in videos is the 2D optical flow. Previous work has shown that flow is useful for action recognition and, consequently, we explore how to better estimate human flow and improve action recognition.

Specifically, we train a neural network to compute single-human [] and multi-human [] optical flow. To enable this we create a new synthetic training database of image sequences with ground-truth human optical flow. For this we use the 3D SMPL body model, motion-capture data, and computer graphics to synthesize realistic flow fields; this effectively extends the SURREAL dataset []. We then train a convolutional neural network (SpyNet []) to estimate human optical flow from pairs of images.

The new network is more accurate than a wide range of top methods on held-out test data and generalizes well to real image sequences. When combined with a person detector/tracker, the approach provides a full solution to the problem of 2D human flow estimation.

Most of the top-performing action-recognition methods use optical flow as a ``black box'' input. In [], we take a deeper look at the combination of flow and action recognition, and find that: 1) optical flow is useful for action recognition because it is invariant to appearance, 2) flow accuracy at boundaries and for small displacements is most correlated with action-recognition performance, 3) training optical flow needs to minimize classification error instead of the popular end-point-error (EPE) to improve action recognition, and 4) optical flow learned for action recognition differs from traditional optical flow mostly inside and at the boundary of human bodies.

Members

Perceiving Systems

Anurag Ranjan

Doctoral Researcher

Affiliated Researcher

Autonomous Vision

Yiyi Liao

Perceiving Systems, Autonomous Vision

Fatma Güney

Doctoral Researcher

Perceiving Systems

Varun Jampani

Autonomous Vision, Perceiving Systems

Guest Scientist

Perceiving Systems

Siyu Tang

Guest Scientist

Publications

Perceiving Systems Article Learning Multi-Human Optical Flow Ranjan, A., Hoffmann, D. T., Tzionas, D., Tang, S., Romero, J., Black, M. J. International Journal of Computer Vision (IJCV), 128(4):873-890, April 2020 (Published) pdf DOI poster DOI URL BibTeX

Perceiving Systems Ph.D. Thesis Towards Geometric Understanding of Motion Ranjan, A. University of Tübingen, December 2019 () PhD Thesis BibTeX

Perceiving Systems Conference Paper Learning Human Optical Flow Ranjan, A., Romero, J., Black, M. J. In 29th British Machine Vision Conference, September 2018 () video code pdf URL BibTeX

Perceiving Systems Conference Paper Learning from Synthetic Humans Varol, G., Romero, J., Martin, X., Mahmood, N., Black, M. J., Laptev, I., Schmid, C. In Proceedings IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017, :4627-4635, IEEE, Piscataway, NJ, USA, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), July 2017 () arXiv project data BibTeX