FrankMocap: A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

Last update: Jan 07, 2023

Related tags

Overview

FrankMocap: A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

FrankMocap pursues an easy-to-use single view 3D motion capture system developed by Facebook AI Research (FAIR). FrankMocap provides state-of-the-art 3D pose estimation outputs for body, hand, and body+hands in a single system. The core objective of FrankMocap is to democratize the 3D human pose estimation technology, enabling anyone (researchers, engineers, developers, artists, and others) can easily obtain 3D motion capture outputs from videos and images.

Btw, why the name FrankMocap? Our pipeline to integrate body and hand modules reminds us of Frankenstein's monster!

News:

[2020/10/09] We have improved openGL rendering speed. It's about 40% faster. (e.g., body module: 6fps -> 11fps)

Key Features

Body Motion Capture:

Hand Motion Capture

Egocentric Hand Motion Capture

Whole body Motion Capture (body + hands)

Installation

See INSTALL.md

A Quick Start

Run body motion capture

# using a machine with a monitor to show output on screen
python -m demo.demo_bodymocap --input_path ./sample_data/han_short.mp4 --out_dir ./mocap_output

# screenless mode (e.g., a remote server)
xvfb-run -a python -m demo.demo_bodymocap --input_path ./sample_data/han_short.mp4 --out_dir ./mocap_output

Run hand motion capture

# using a machine with a monitor to show outputs on screen
python -m demo.demo_handmocap --input_path ./sample_data/han_hand_short.mp4 --out_dir ./mocap_output

# screenless mode  (e.g., a remote server)
xvfb-run -a python -m demo.demo_handmocap --input_path ./sample_data/han_hand_short.mp4 --out_dir ./mocap_output

Run whole body motion capture

# using a machine with a monitor to show outputs on screen
python -m demo.demo_frankmocap --input_path ./sample_data/han_short.mp4 --out_dir ./mocap_output

# screenless mode  (e.g., a remote server)
xvfb-run -a python -m demo.demo_frankmocap --input_path ./sample_data/han_short.mp4 --out_dir ./mocap_output

Note:
- Above commands use openGL by default. If it does not work, you may try alternative renderers (pytorch3d or openDR).
- See the readme of each module for details

Joint Order

See joint_order

Body Motion Capture Module

See run_bodymocap

Hand Motion Capture Module

See run_handmocap

Whole Body Motion Capture Module (Body + Hand)

See run_totalmocap

License

CC-BY-NC 4.0. See the LICENSE file.

References

FrankMocap is based on the following research outputs:

@article{rong2020frankmocap,
  title={FrankMocap: Fast Monocular 3D Hand and Body Motion Capture by Regression and Integration},
  author={Rong, Yu and Shiratori, Takaaki and Joo, Hanbyul},
  journal={arXiv preprint arXiv:2008.08324},
  year={2020}
}

@article{joo2020eft,
  title={Exemplar Fine-Tuning for 3D Human Pose Fitting Towards In-the-Wild 3D Human Pose Estimation},
  author={Joo, Hanbyul and Neverova, Natalia and Vedaldi, Andrea},
  journal={arXiv preprint arXiv:2004.03686},
  year={2020}
}

FrankMocap leverages many amazing open-sources shared in research community.
- SMPL, SMPLX
- Detectron2
- Pytorch3D (for rendering)
- OpenDR (for rendering)
- SPIN (for body module)
- 100DOH (for hand detection)
- lightweight-human-pose-estimation (for body detection)

FrankMocap: A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

Related tags

Overview

FrankMocap: A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

News:

Key Features

Installation

A Quick Start

Joint Order

Body Motion Capture Module

Hand Motion Capture Module

Whole Body Motion Capture Module (Body + Hand)

License

References

Owner

Facebook Research

An Unsupervised Detection Framework for Chinese Jargons in the Darknet

This is a model to classify Vietnamese sign language using Motion history image (MHI) algorithm and CNN.

PyJokes - Joking around with Python library pyjokes

RepVGG: Making VGG-style ConvNets Great Again

Code for technical report "An Improved Baseline for Sentence-level Relation Extraction".

A Unified Framework and Analysis for Structured Knowledge Grounding

This computer program provides a reference implementation of Lagrangian Monte Carlo in metric induced by the Monge patch

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

A code generator from ONNX to PyTorch code

Codes for the ICCV'21 paper "FREE: Feature Refinement for Generalized Zero-Shot Learning"

People log into different sites every day to get information and browse through these sites one by one

A faster pytorch implementation of faster r-cnn

π-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis

A new version of the CIDACS-RL linkage tool suitable to a cluster computing environment.

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

Code for "Layered Neural Rendering for Retiming People in Video."

This is the code of "Multi-view Contrastive Graph Clustering" in NeurlPS 2021.

NAS-HPO-Bench-II is the first benchmark dataset for joint optimization of CNN and training HPs.

This program presents convolutional kernel density estimation, a method used to detect intercritical epilpetic spikes (IEDs)

Demo code for ICCV 2021 paper "Sensor-Guided Optical Flow"