Code for "Reconstructing 3D Human Pose by Watching Humans in the Mirror", CVPR 2021 oral

Last update: Dec 13, 2022

Related tags

Deep Learning Mirrored-Human

Overview

Reconstructing 3D Human Pose by Watching Humans in the Mirror

Qi Fang*, Qing Shuai*, Junting Dong, Hujun Bao, Xiaowei Zhou
CVPR 2021 Oral

^{The videos are from Youtube and Douyin. Please contact us for any copyright issue.}

News

We build a website for a fast preview of our dataset. The whole dataset will be released later.

Features

In this paper, we introduce the new task of reconstructing 3D human pose from a single image in which we can see the person and the person’s image through a mirror.

This implementation:

has the demo of our optimization-based approach implemented purely in PyTorch.
provides a method to estimate the surface normal of the mirror from vanishing points.
provides an annotator to label the mirror edges for the vanishing points.
can estimate the focal length of the Internet mirror images.

Installation

This repo has a close relation with EasyMocap. Please refer to our EasyMocap project for installation.

Demo

Download our zju-m-test.zip and run the following code:

# set the data path
data=<path_to_sample>/zju-m-demo
out=<path_to_sample>/zju-m-demo-output
# extract the video frames
python3 scripts/preprocess/extract_video.py ${data}
# Run demo on videos
python3 apps/demo/1v1p_mirror.py ${data} --out ${out} --vis_smpl --video

Mirrored-Human Dataset (Coming Soon)

Due to the license limitation, we cannot share the raw data directly. We are working hard to organize the Mirrored-Human dataset in terms of url links and timestamps.

See Build Your Internet Dataset if you can't wait for our release.

Annotator

We also provide the annotator metioned in our paper.

The first row shows that we label the edges of the mirror and calculate the vanishing point by the human body automaticly. The intrisic camera parameter can be calculated by this two vanishing points.

The second row shows that to obtain a more accurate vanishing points and camera parameters, we can label the parallel lines in the scene, for example the door, the grid in the ground, and the door.

See EasyMocap/apps/annotator for more instructions.

Build Custom Internet Dataset

See doc/internet.md for more instructions.

Build Custom Evaluation Dataset (Multi-View)

This part is provided for the researchers who want to:

capture more accurate human motion with multiple cameras and a mirror
build a different evaluation dataset

See doc/custom.md for more instructions.

Evaluation

To evaluate the reconstruction part in our paper, see doc/evaluation.md.

Contact

Please open an issue if you have any questions. We appreciate all contributions to improve our project.

If you find some videos that we didn't notice, please tell us.

Citation

@inproceedings{fang2021mirrored,
  title={Reconstructing 3D Human Pose by Watching Humans in the Mirror},
  author={Fang, Qi and Shuai, Qing and Dong, Junting and Bao, Hujun and Zhou, Xiaowei},
  booktitle={CVPR},
  year={2021}
}

Acknowledgement

This project is build on our EasyMocap. We also would like to thank Jianan Zhen and Yuhao Chen for their advice for the paper. Sincere thanks to the performers (Yuji Chen and Hao Xu) in the evaluation dataset and people who uploaded the mirror-human videos to the Internet.

Recommendations to other works from our group

Welcome to checkout our work on learning-based feature matching (LoFTR) and reconstruction (NeuralBody and NeuralRecon) in CVPR 2021.

Code for "Reconstructing 3D Human Pose by Watching Humans in the Mirror", CVPR 2021 oral

Related tags

Overview

Reconstructing 3D Human Pose by Watching Humans in the Mirror

News

Features

Installation

Demo

Mirrored-Human Dataset (Coming Soon)

Annotator

Build Custom Internet Dataset

Build Custom Evaluation Dataset (Multi-View)

Evaluation

Contact

Citation

Acknowledgement

Recommendations to other works from our group

Owner

ZJU3DV

FirmWire is a full-system baseband firmware emulation platform for fuzzing, debugging, and root-cause analysis of smartphone baseband firmwares

[CVPR2021] The source code for our paper 《Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning》.

Official implementation for "QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation" (CVPR 2022)

Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021

LSTM Neural Networks for Spectroscopic Studies of Type Ia Supernovae

Pre-Training 3D Point Cloud Transformers with Masked Point Modeling

Patch-Based Deep Autoencoder for Point Cloud Geometry Compression

Official Implementation of "Tracking Grow-Finish Pigs Across Large Pens Using Multiple Cameras"

MINOS: Multimodal Indoor Simulator

A hifiasm fork for metagenome assembly using Hifi reads.

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning. CVPR 2018

The Noise Contrastive Estimation for softmax output written in Pytorch

Graph parsing approach to structured sentiment analysis.

TorchCV: A PyTorch-Based Framework for Deep Learning in Computer Vision

Unofficial implementation of "Coordinate Attention for Efficient Mobile Network Design"

This repository contains all data used for writing a research paper Multiple Object Trackers in OpenCV: A Benchmark, presented in ISIE 2021 conference in Kyoto, Japan.

PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).

Official code for 'Robust Siamese Object Tracking for Unmanned Aerial Manipulator' and offical introduction to UAMT100 benchmark

Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation (ICCV 2021)

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network