This is an official implementation for "Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation".

Last update: Jan 07, 2023

Related tags

Deep Learning StridedTransformer-Pose3D

Overview

Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation

This repo is the official implementation of Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation in Pytorch.

Dependencies

Cuda 11.1
Python 3.6
Pytorch 1.7.1

Dataset setup

Please download the dataset from Human3.6m website and refer to VideoPose3D to set up the Human3.6M dataset ('./dataset' directory).

${POSE_ROOT}/
|-- dataset
|   |-- data_3d_h36m.npz
|   |-- data_2d_h36m_gt.npz
|   |-- data_2d_h36m_cpn_ft_h36m_dbb.npz

Download pretrained model

The pretrained model can be found in Google_Drive, please download it and put in the './checkpoint' dictory.

Test the model

To test on pretrained model on Human3.6M with 351-frames:

python main.py --frames 351 --refine --reload 1  --refine_reload 1 --previous_dir 'checkpoint/351'

Train the model

To train on Human3.6M with 351-frame:

python main.py --frames 351 --train 1 \

After training for several epoches, add refine module

python main.py --frames 351 --train 1 --refine --lr 1e-5 --reload 1 --previous_dir [your model saved path] \

Citation

If you find our work useful in your research, please consider citing:

@article{li2021exploiting,
  title={Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation},
  author={Li, Wenhao and Liu, Hong and Ding, Runwei and Liu, Mengyuan and Wang, Pichao and Yang, Wenming},
  journal={arXiv preprint arXiv:2103.14304},
  year={2021}
}

Acknowledgement

Our code is built on top of ST-GCN and is extended from the following repositories. We thank the authors for releasing the codes.

This is an official implementation for "Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation".

Related tags

Overview

Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation

Dependencies

Dataset setup

Download pretrained model

Test the model

Train the model

Citation

Acknowledgement

Owner

Vegetabird

Gym environments used in the paper: "Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors"

Tensorflow implementation of "BEGAN: Boundary Equilibrium Generative Adversarial Networks"

This is an official implementation of the paper "Distance-aware Quantization", accepted to ICCV2021.

Crossover Learning for Fast Online Video Instance Segmentation (ICCV 2021)

PyTorch META-DATASET (Few-shot classification benchmark)

Controlling the MicriSpotAI robot from scratch

Code to reproduce the experiments in the paper "Transformer Based Multi-Source Domain Adaptation" (EMNLP 2020)

official implemntation for "Contrastive Learning with Stronger Augmentations"

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Gas detection for Raspberry Pi using ADS1x15 and MQ-2 sensors

OoD Minimum Anomaly Score GAN - Code for the Paper 'OMASGAN: Out-of-Distribution Minimum Anomaly Score GAN for Sample Generation on the Boundary'

Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)

TransPrompt - Towards an Automatic Transferable Prompting Framework for Few-shot Text Classification

Kaggle | 9th place (part of) solution for the Bristol-Myers Squibb – Molecular Translation challenge

This is the code repository for the paper A hierarchical semantic segmentation framework for computer-vision-based bridge column damage detection

The Simplest DCGAN Implementation

The official implementation of NeMo: Neural Mesh Models of Contrastive Features for Robust 3D Pose Estimation [ICLR-2021]. https://arxiv.org/pdf/2101.12378.pdf

A Unified Generative Framework for Various NER Subtasks.

Attention-based CNN-LSTM and XGBoost hybrid model for stock prediction

Deep Learning: Architectures & Methods Project: Deep Learning for Audio Super-Resolution