HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation

Last update: Dec 04, 2022

Related tags

Overview

HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation

Official PyTroch implementation of HPRNet.

HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation,
Nermin Samet, Emre Akbas,
Under review. (arXiv pre-print)

Highlights

HPRNet is a bottom-up, one-stage and hierarchical keypoint regression method for whole-body pose estimation.
HPRNet has the best performance among bottom-up methods for all the whole-body parts.
HPRNet achieves SOTA performance for the face (76.0 AP) and hand (51.2 AP) keypoint estimation.
Unlike two-stage methods, HPRNet predicts whole-body pose in a constant time independent of the number of people in an image.

COCO-WholeBody Keypoint Estimation Results

Model	Body AP	Foot AP	Face AP	Hand AP	Whole-body AP	Download
HPRNet (DLA)	55.2 / 57.1	49.1 / 50.7	74.6 / 75.4	47.0 / 48.4	31.5 / 32.7	model
HPRNet (Hourglass)	59.4 / 61.1	53.0 / 53.9	75.4 / 76.0	50.4 / 51.2	34.8 / 34.9	model

Results are presented without and with test time flip augmentation respectively.
All models are trained on COCO-WholeBody train2017 and evaluated on val2017.
The models can be downloaded directly from Google drive.

Installation

[Optional but recommended] create a new conda environment.
```
conda create --name HPRNet python=3.7
```
And activate the environment.
```
conda activate HPRNet
```

Clone the repo:

HPRNet_ROOT=/path/to/clone/HPRNet
git clone https://github.com/nerminsamet/HPRNet $HPRNet_ROOT

Install PyTorch 1.4.0:

conda install pytorch torchvision cudatoolkit=10.0 -c pytorch

Install the requirements:
```
pip install -r requirements.txt
```

Compile DCNv2 (Deformable Convolutional Networks):

cd $HPRNet_ROOT/src/lib/models/networks/DCNv2
./make.sh

Dataset preparation

Download the images (2017 Train, 2017 Val) from coco website.

Download train and val annotation files.

${COCO_PATH}
|-- annotations
    |-- coco_wholebody_train_v1.0.json
    |-- coco_wholebody_val_v1.0.json
|-- images
    |-- train2017
    |-- val2017

Evaluation and Training

You could find all the evaluation and training scripts in the experiments folder.
For evaluation, please download the pretrained models you want to evaluate and put them in HPRNet_ROOT/models/.
In the case that you don't have 4 GPUs, you can follow the linear learning rate rule to adjust the learning rate.
If the training is terminated before finishing, you can use the same command with --resume to resume training.

Acknowledgement

The numerical calculations reported in this paper were fully performed at TUBITAK ULAKBIM, High Performance and Grid Computing Center (TRUBA resources).

License

HPRNet is released under the MIT License (refer to the LICENSE file for details).

Citation

If you find HPRNet useful for your research, please cite our paper as follows:

N. Samet, E. Akbas, "HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation", arXiv, 2021.

BibTeX entry:

@misc{hprnet,
      title={HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation}, 
      author={Nermin Samet and Emre Akbas},
      year={2021}, 
}

HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation

Related tags

Overview

HPRNet: Hierarchical Point Regression for Whole-Body Human Pose Estimation

Highlights

COCO-WholeBody Keypoint Estimation Results

Installation

Dataset preparation

Evaluation and Training

Acknowledgement

License

Citation

Owner

Nermin Samet

Code for the Lovász-Softmax loss (CVPR 2018)

OpenL3: Open-source deep audio and image embeddings

Tensorflow implementation of Fully Convolutional Networks for Semantic Segmentation

PyTorch reimplementation of the paper Involution: Inverting the Inherence of Convolution for Visual Recognition [CVPR 2021].

🤖 A Python library for learning and evaluating knowledge graph embeddings

Python lib to talk to pylontech lithium batteries (US2000, US3000, ...) using RS485

Toward Spatially Unbiased Generative Models (ICCV 2021)

This code finds bounding box of a single human mouth.

CAST: Character labeling in Animation using Self-supervision by Tracking

Mall-Customers-Segmentation - Customer Segmentation Using K-Means Clustering

This repo holds code for TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

Reproduces ResNet-V3 with pytorch

Train a state-of-the-art yolov3 object detector from scratch!

Clockwork Variational Autoencoder

Collection of Docker images for ML/DL and video processing projects

A scientific and useful toolbox, which contains practical and effective long-tail related tricks with extensive experimental results

disentanglement_lib is an open-source library for research on learning disentangled representations.

Scripts of Machine Learning Algorithms from Scratch. Implementations of machine learning models and algorithms using nothing but NumPy with a focus on accessibility. Aims to cover everything from basic to advance.

Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training

Supervised multi-SNE (S-multi-SNE): Multi-view visualisation and classification