Back

Perceiving Systems Members Publications

Inferring Actions

Inferring actions teaser
Top: Temporal Action Localization (TAL). The bilinear pooling algorithm [File Icon] recognizes and localizes all actions in an RGB video (yellow background). The hierarchical clustering algorithm [File Icon] uses both RGB and 3D joint positions for TAL (blue box). Below: The Action Recognition model in BABEL [File Icon] predicts the action in a mocap sequence.

Members

Publications

Perceiving Systems Conference Paper BABEL: Bodies, Action and Behavior with English Labels Punnakkal, A. R., Chandrasekaran, A., Athanasiou, N., Quiros-Ramirez, M. A., Black, M. J. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021), :722-731, IEEE, Piscataway, NJ, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021) , June 2021 (Published) dataset poster pdf sup mat video code DOI BibTeX

Perceiving Systems Empirical Inference Conference Paper Local Temporal Bilinear Pooling for Fine-grained Action Parsing Zhang, Y., Tang, S., Muandet, K., Jarvers, C., Neumann, H. In Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), :12005-12015, IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), June 2019 () Code video demo pdf URL BibTeX

Perceiving Systems Article Temporal Human Action Segmentation via Dynamic Clustering Zhang, Y., Sun, H., Tang, S., Neumann, H. arXiv preprint arXiv:1803.05790, 2018 () URL BibTeX