Realtime_Multi-Person_Pose_Estimation

Last update: Jan 05, 2023

Overview

Introduction

Multi Person PoseEstimation By PyTorch

Results

Require

Pytorch

Installation

git submodule init && git submodule update

Demo

Download converted pytorch model.
Compile the C++ postprocessing: cd lib/pafprocess; sh make.sh
python demo/picture_demo.py to run the picture demo.
python demo/web_demo.py to run the web demo.

Evalute

python evaluate/evaluation.py to evaluate the model on coco val2017 dataset.
It should have mAP 0.653 for the rtpose, previous rtpose have mAP 0.577 because we do left and right flip for heatmap and PAF for the evaluation. c

Main Results

model name	mAP	Inference Time
[original rtpose]	0.653	-

Download link: rtpose

Development environment

The code is developed using python 3.6 on Ubuntu 18.04. NVIDIA GPUs are needed. The code is developed and tested using 4 1080ti GPU cards. Other platforms or GPU cards are not fully tested.

Quick start

1. Preparation

1.1 Prepare the dataset

cd training; bash getData.sh to obtain the COCO 2017 images in /data/root/coco/images/, keypoints annotations in /data/root/coco/annotations/, make them look like this:

${DATA_ROOT}
|-- coco
    |-- annotations
        |-- person_keypoints_train2017.json
        |-- person_keypoints_val2017.json
    |-- images
        |-- train2017
            |-- 000000000009.jpg
            |-- 000000000025.jpg
            |-- 000000000030.jpg
            |-- ... 
        |-- val2017
            |-- 000000000139.jpg
            |-- 000000000285.jpg
            |-- 000000000632.jpg
            |-- ...

2. How to train the model

Modify the data directory in train/train_VGG19.py and python train/train_VGG19.py

Related repository

CVPR'17, Realtime Multi-Person Pose Estimation.

Network Architecture

testing architecture
training architecture

Contributions

All contributions are welcomed. If you encounter any issue (including examples of images where it fails) feel free to open an issue.

Citation

Please cite the paper in your publications if it helps your research:

@InProceedings{cao2017realtime,
  title = {Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields},
  author = {Zhe Cao and Tomas Simon and Shih-En Wei and Yaser Sheikh},
  booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2017}
  }

Realtime_Multi-Person_Pose_Estimation

Related tags

Overview

Introduction

Results

Require

Installation

Demo

Evalute

Main Results

Development environment

Quick start

1. Preparation

1.1 Prepare the dataset

2. How to train the model

Related repository

Network Architecture

Contributions

Citation

Owner

tensorboy

Transport Mode detection - can detect the mode of transport with the help of features such as acceeration,jerk etc

CUDA Python Low-level Bindings

This repository contains the code for the binaural-detection model used in the publication arXiv:2111.04637

NeuralCompression is a Python repository dedicated to research of neural networks that compress data

An Artificial Intelligence trying to drive a car by itself on a user created map

Denoising Diffusion Probabilistic Models

pcnaDeep integrates cutting-edge detection techniques with tracking and cell cycle resolving models.

The CLRS Algorithmic Reasoning Benchmark

Multi-task Self-supervised Object Detection via Recycling of Bounding Box Annotations (CVPR, 2019)

SOTA easy to use PyTorch-based DL training library

Code for Talk-to-Edit (ICCV2021). Paper: Talk-to-Edit: Fine-Grained Facial Editing via Dialog.

This repository contains Prior-RObust Bayesian Optimization (PROBO) as introduced in our paper "Accounting for Gaussian Process Imprecision in Bayesian Optimization"

Code in PyTorch for the convex combination linear IAF and the Householder Flow, J.M. Tomczak & M. Welling

The repository contains reproducible PyTorch source code of our paper Generative Modeling with Optimal Transport Maps, ICLR 2022.

Pytorch implementation of the paper DocEnTr: An End-to-End Document Image Enhancement Transformer.

Template repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.

Framework that uses artificial intelligence applied to mathematical models to make predictions

A general framework for inferring CNNs efficiently. Reduce the inference latency of MobileNet-V3 by 1.3x on an iPhone XS Max without sacrificing accuracy.

A pytorch reproduction of { Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation }.

A library of extension and helper modules for Python's data analysis and machine learning libraries.