Header logo is

Localization and recognition of human action in 3D using transformers

2024

Article

ps


Understanding a person’s behavior from their 3D motion sequence is a fundamental problem in computer vision with many applications. An important component of this problem is 3D action localization, which involves recognizing what actions a person is performing, and when the actions occur in the sequence. To promote the progress of the 3D action localization community, we introduce a new, challenging, and more complex benchmark dataset, BABEL-TAL (BT), for 3D action localization. Important baselines and evaluating metrics, as well as human evaluations, are carefully established on this benchmark. We also propose a strong baseline model, i.e., Localizing Actions with Transformers (LocATe), that jointly localizes and recognizes actions in a 3D sequence. The proposed LocATe shows superior performance on BABEL-TAL as well as on the large-scale PKU-MMD dataset, achieving state-of-the-art performance by using only 10% of the labeled training data. Our research could advance the development of more accurate and efficient systems for human behavior analysis, with potential applications in areas such as human-computer interaction and healthcare.

Author(s): Jiankai Sun and Linjiang Huang and Hongsong Wang, Chuanyang Zheng, Jianing Qiu and Md Tauhidul Islam and Enze Xie and Bolei Zhou and Lei Xing and Arjun Chandrasekaran and Michael J. Black
Journal: Nature Communications Engineering
Volume: 13
Number (issue): 125
Year: 2024
Month: September

Department(s): Perceiving Systems
Bibtex Type: Article (article)
Paper Type: Journal

DOI: https://www.nature.com/articles/s44172-024-00272-7

Links: paper

BibTex

@article{locate:2024,
  title = {Localization and recognition of human action in {3D} using transformers},
  author = {Sun, Jiankai and Huang, Linjiang and Hongsong Wang, Chuanyang Zheng, Jianing Qiu and Islam, Md Tauhidul and Xie, Enze and Zhou, Bolei and Xing, Lei and Chandrasekaran, Arjun and Black, Michael J.},
  journal = {Nature Communications Engineering },
  volume = {13},
  number = {125},
  month = sep,
  year = {2024},
  doi = {https://www.nature.com/articles/s44172-024-00272-7},
  month_numeric = {9}
}