4D Human Body Capture from Egocentric Video via 3D Scene Grounding

Last update: Nov 08, 2022

Related tags

Overview

4D Human Body Capture from Egocentric Video via 3D Scene Grounding

Installation:

Our method requires the same dependencies as SMPLify-X and OpenPose. We refer to the official implementation fo SMPLify-X and OpenPose for installation details.

Our method also needs the installation of Chamfer Pytorch to calculate the chamfer distnace for enforceing human-scene constraints

Data Preparation:

Step 1: Dump video frames with desired fps (30) with utils/dump_videos.py. Run utils/split_frames to segment videos into equally long subatom clips. Repack frames to videos with utils/pack_videos.py (This is for faster openpose I/O).

Step 2: Run openpose_call.py under openpose folder to get human body keypoints, then run utils/openpose_helper to rename keypoint.json and run utils/openpose_filter.py to keep the most confident human keypoints.

Step 3: Run Smplify-X model with specified focal length and data directory. This step may take up to several hours. For instance:

python3 smplifyx/main.py --config cfg_files/fit_smplx.yaml  --data_folder /home/miao/data/rylm/downsampled_frames/miao_mainbuilding_0-1 --output_folder /home/miao/data/rylm/downsampled_frames/miao_mainbuilding_0-1/body_gen --visualize="False" --model_folder ./models --vposer_ckpt ./vposer --part_segm_fn smplx_parts_segm.pkl --focal_length 694.0

Step 4: Run Colmap for to generate scene mesh and camera trajectory. This step make take up to several hours depneding on the complexity of the scene. Then Run utils/camerpose_helper and utils/pointscloud_helper.py to generate desired points cloud file and camera pose.

Joint Optimization with 3D Scene Context:

Run global_optimization.py to conduct temproal smoothing and enforce human-scene constraints:

python3 global_optimization.py '/home/miao/data/rylm/packed_data/miao_mainbuidling_0-1/body_gen' '/home/miao/data/rylm/packed_data/miao_mainbuidling_0-1/smoothed_body

The resulting data should be organized as following:

datafolder:
- videoname:
  - images: folder that contains all video frames
  - keypoints: folder that contains all body keypoints
  - body_gen: folder that contains all body mesh files:
  - smoothed_boyd: folder that contains all jointly-optimized body mesh files:
  - camera_pose.txt: text file that contains camera pose at each temporal footprint
  - meshed-poisson.ply: scene mesh file from dense reconstruction
  - camera.txt: text file that contains camera parameters
  - xyz.ply point cloud file. (use meash lab to convert .xyz file to .ply file)

Visualization in the World Coordinate:

Run global_vis.py to transform the body mesh in pivot coordinate to world coordinate. By default the viewpoint of open3d is the initial position camera trajectory. Setting bool flag to 'True' will resulting into a open3d viewpoint moving the same way as camera viewer.

python3 global_vis.py '/home/miao/data/rylm/downsampled_frames/miao_mainbuilding_0-1/' False

Visualization in the Egocentric Coordinate:

Run vis.py to view recosntrcuted body mesh on image plane.

python3 vis.py '/home/miao/data/rylm/segmented_data/miao_mainbuilding_0-1/'

Citation

If you find our code useful in your research, please use the following BibTeX entry for citation.

@inproceedings{liu20204d,
  title={4D Human Body Capture from Egocentric Video via 3D Scene Grounding},
  author={Liu, Miao and Yang, Dexin and Zhang, Yan and Cui, Zhaopeng and Rehg, James M and Tang, Siyu},
  booktitle={3DV},
  year={2021}
}

4D Human Body Capture from Egocentric Video via 3D Scene Grounding

Related tags

Overview

4D Human Body Capture from Egocentric Video via 3D Scene Grounding

Installation:

Data Preparation:

Joint Optimization with 3D Scene Context:

Visualization in the World Coordinate:

Visualization in the Egocentric Coordinate:

Citation

Owner

Miao Liu

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

This is the Pytorch implementation of Progressive Attentional Manifold Alignment.

This is the official implementation of the paper "Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation".

Use AI to generate a optimized stock portfolio

A simple approach to emable dense segmentation with ViT.

Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper

Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].

[CVPR2021] The source code for our paper 《Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning》.

A high-performance anchor-free YOLO. Exceeding yolov3~v5 with ONNX, TensorRT, NCNN, and Openvino supported.

Application of K-means algorithm on a music dataset after a dimensionality reduction with PCA

Auxiliary data to the CHIIR paper Searching to Learn with Instructional Scaffolding

Auto-Lama combines object detection and image inpainting to automate object removals

PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Official PyTorch Implementation of SSMix (Findings of ACL 2021)

Tooling for GANs in TensorFlow

Machine Learning Toolkit for Kubernetes

Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

Code for the submitted paper Surrogate-based cross-correlation for particle image velocimetry

Pca-on-genotypes - Mini bioinformatics project - PCA on genotypes

Code, Models and Datasets for OpenViDial Dataset

4D Human Body Capture from Egocentric Video via 3D Scene Grounding

Related tags

Overview

4D Human Body Capture from Egocentric Video via 3D Scene Grounding

Installation:

Data Preparation:

Joint Optimization with 3D Scene Context:

Visualization in the World Coordinate:

Visualization in the Egocentric Coordinate:

Citation

Owner

Miao Liu

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

​ This is the Pytorch implementation of Progressive Attentional Manifold Alignment.

This is the official implementation of the paper "Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation".

Use AI to generate a optimized stock portfolio

A simple approach to emable dense segmentation with ViT.

Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper

Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].

[CVPR2021] The source code for our paper 《Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning》.

A high-performance anchor-free YOLO. Exceeding yolov3~v5 with ONNX, TensorRT, NCNN, and Openvino supported.

Application of K-means algorithm on a music dataset after a dimensionality reduction with PCA

Auxiliary data to the CHIIR paper Searching to Learn with Instructional Scaffolding

Auto-Lama combines object detection and image inpainting to automate object removals

PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Official PyTorch Implementation of SSMix (Findings of ACL 2021)

Tooling for GANs in TensorFlow

Machine Learning Toolkit for Kubernetes

Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

Code for the submitted paper Surrogate-based cross-correlation for particle image velocimetry

Pca-on-genotypes - Mini bioinformatics project - PCA on genotypes

Code, Models and Datasets for OpenViDial Dataset

This is the Pytorch implementation of Progressive Attentional Manifold Alignment.