MULTI HYBRID EXTRACTOR NETWORK FOR 3D HUMAN POSE ESTIMATION

Zhixiang Yuan, Xitie Zhang, Suping Wu, Boyang Zhang, Yuxin Peng, Bing Wang

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Poster 11 Oct 2023

Monocular image or video based 3D human pose estimation remains a very challenging task because of depth ambiguity and occluded joints. To relieve this limitation, we propose a Multiple Hybrid Extraction Network (MHENet), which obtains three different representations of pose hypotheses features by multiple hybrid extractors with different structures, and uses pose interaction and fusion to obtain accurate 3D pose. The Hybrid Extraction Module obtains three hypotheses features: base features correspond to structural information, diverse features correspond to detail information, and condensed features correspond to action information. Hypotheses Interaction Fusion Modul builds relationships across hypotheses feature to generate more accurate 3D poses. Extensive qualitative and quantitative experimental results on a large-scale publicly available dataset demonstrate that our approach achieves competitive performance compared to state-of-the-art methods. The code will be made publicly.

Tags:

3d human pose estimation

transformer

cnn

encoder-decoder network

MULTI HYBRID EXTRACTOR NETWORK FOR 3D HUMAN POSE ESTIMATION

Zhixiang Yuan, Xitie Zhang, Suping Wu, Boyang Zhang, Yuxin Peng, Bing Wang

More Like This

Devising Transformers as an Autoencoder for Unsupervised Multivariate Time Series Imputation

Slides: Devising Transformers as an Autoencoder for Unsupervised Multivariate Time Series Imputation

3D-CSL: SELF-SUPERVISED 3D CONTEXT SIMILARITY LEARNING FOR NEAR-DUPLICATE VIDEO RETRIEVAL

Join an IEEE Society