MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Last update: Jan 07, 2023

Related tags

Overview

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

This repo is the official implementation of "MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation, Wenhao Li, Hong Liu, Hao Tang, Pichao Wang, Luc Van Gool" in PyTorch.

Dependencies

Cuda 11.1
Python 3.6
Pytorch 1.7.1

Dataset setup

Please download the dataset from Human3.6m website and refer to VideoPose3D to set up the Human3.6M dataset ('./dataset' directory).

${POSE_ROOT}/
|-- dataset
|   |-- data_3d_h36m.npz
|   |-- data_2d_h36m_cpn_ft_h36m_dbb.npz

Download pretrained model

The pretrained model can be found in Google_Drive, please download it and put in the './checkpoint' dictory.

Test the model

To test on pretrained model on Human3.6M:

python main.py --reload --previous_dir 'checkpoint/pretrained'

Here, we compare our MHFormer with recent state-of-the-art methods on Human3.6M dataset. Evaluation metric is Mean Per Joint Position Error (MPJPE) in mm.

Models	MPJPE
VideoPose3D	46.8
PoseFormer	44.3
MHFormer	43.0

Train the model

To train on Human3.6M:

python main.py --train

Citation

If you find our work useful in your research, please consider citing:

@article{li2021mhformer,
  title={MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation},
  author={Li, Wenhao and Liu, Hong and Tang, Hao and Wang, Pichao and Van Gool, Luc},
  journal={arXiv preprint},
  year={2021}
}

Acknowledgement

Our code is extended from the following repositories. We thank the authors for releasing the codes.

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Related tags

Overview

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Dependencies

Dataset setup

Download pretrained model

Test the model

Train the model

Citation

Acknowledgement

Owner

Vegetabird

CVPR 2021 Challenge on Super-Resolution Space

Step by Step on how to create an vision recognition model using LOBE.ai, export the model and run the model in an Azure Function

A large-image collection explorer and fast classification tool

Simulation environments for the CrazyFlie quadrotor: Used for Reinforcement Learning and Sim-to-Real Transfer

Pose estimation with MoveNet Lightning

This example implements the end-to-end MLOps process using Vertex AI platform and Smart Analytics technology capabilities

Yolo algorithm for detection + centroid tracker to track vehicles

Hippocampal segmentation using the UNet network for each axis

An open-source project for applying deep learning to medical scenarios

Reverse engineering Rosetta 2 in M1 Mac

Circuit Training: An open-source framework for generating chip floor plans with distributed deep reinforcement learning

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Sign Language Translation with Transformers (COLING'2020, ECCV'20 SLRTP Workshop)

Control-Robot-Arm-using-PS4-Controller - A Robotic Arm based on Raspberry Pi and Arduino that controlled by PS4 Controller

YoloV5 implemented by TensorFlow2 , with support for training, evaluation and inference.

Online Multi-Granularity Distillation for GAN Compression (ICCV2021)

Revisiting Temporal Alignment for Video Restoration

CausalNLP is a practical toolkit for causal inference with text as treatment, outcome, or "controlled-for" variable.

Repo for CVPR2021 paper "QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information"

Research shows Google collects 20x more data from Android than Apple collects from iOS. Block this non-consensual telemetry using pihole blocklists.