Tensorflow Implementation of ECCV'18 paper: Multimodal Human Motion Synthesis

Last update: Oct 02, 2022

Overview

MT-VAE for Multimodal Human Motion Synthesis

This is the code for ECCV 2018 paper MT-VAE: Learning Motion Transformations to Generate Multimodal Human Dynamics by Xinchen Yan, Akash Rastogi, Ruben Villegas, Kalyan Sunkavalli, Eli Shechtman, Sunil Hadap, Ersin Yumer, Honglak Lee.

Please follow the instructions to run the code.

Requirements

MT-VAE requires or works with

Mac OS X or Linux
NVIDIA GPU

Installing Dependency

Install TensorFlow
Note: this implementation has been tested with TensorFlow 1.3.

Data Preprocessing

For Human3.6M dataset, please download the pre-processed dataset.

bash prep_human36m_joints.sh

Disclaimer: Please check the license of Human3.6M dataset if you download this preprocessed version.

Training (MT-VAE)

If you want to train the MT-VAE human motion generator, please run the following script (usually it takes 1 day with a single Titan GPU).

bash demo_human36m_trainMTVAE.sh

Alternatively, you can download the pre-trained MT-VAE model, please run the following script.

bash prep_human36m_model.sh

Motion Synthesis Using Pre-trained MT-VAE Model

Please run the following command to generate multiple diverse human motion given initial motion.

bash demo_human36m_inferMTVAE.sh

Motion Analogy-making Using Pre-trained MT-VAE Model

Please run the following command to execute motion analogy-making.

bash demo_human36m_analogyMTVAE.sh

Hierchical Video Synthesis Using Pre-trained Image Generation Model

Please download full Human3.6M videos into the workspace/Human3.6M/ folder.
We use a pre-trained model from the ICML 2017 HierchVid Repository. Please run the following command for image synthesis given generated motion sequence.

CUDA_VISIBLE_DEVICE=0 python h36m_hierach_gensample.py

Disclaimer: Please double check the license in that repository and cite HierchVid paper when use.

Citation

If you find this useful, please cite our work as follows:

@inproceedings{yan2018mt,
  title={MT-VAE: Learning Motion Transformations to Generate Multimodal Human Dynamics},
  author={Yan, Xinchen and Rastogi, Akash and Villegas, Ruben and Sunkavalli, Kalyan and Shechtman, Eli and Hadap, Sunil and Yumer, Ersin and Lee, Honglak},
  booktitle={European Conference on Computer Vision},
  pages={276--293},
  year={2018},
  organization={Springer}
}

Acknowledgements

We would like to thank the amazing developers and the open-sourcing community. Our implementation has especially been benefited from the following excellent repositories:

Attribute2Image: https://github.com/xcyan/eccv16_attr2img
TensorFlow-PTN: https://github.com/tensorflow/models/tree/master/research/ptn
VideoGAN: https://github.com/cvondrick/videogan
MoCoGAN: https://github.com/sergeytulyakov/mocogan
HierchVid: https://github.com/rubenvillegas/icml2017hierchvid
Sketch-RNN: https://github.com/tensorflow/magenta/tree/master/magenta/models/sketch_rnn
VRNN: https://github.com/jych/nips2015_vrnn
SVG: https://github.com/edenton/svg

Tensorflow Implementation of ECCV'18 paper: Multimodal Human Motion Synthesis

Related tags

Overview

MT-VAE for Multimodal Human Motion Synthesis

Requirements

Installing Dependency

Data Preprocessing

Training (MT-VAE)

Motion Synthesis Using Pre-trained MT-VAE Model

Motion Analogy-making Using Pre-trained MT-VAE Model

Hierchical Video Synthesis Using Pre-trained Image Generation Model

Citation

Acknowledgements

Owner

Xinchen Yan

Machine Learning Privacy Meter: A tool to quantify the privacy risks of machine learning models with respect to inference attacks, notably membership inference attacks

Hamiltonian Dynamics with Non-Newtonian Momentum for Rapid Sampling

Code for Mesh Convolution Using a Learned Kernel Basis

Perturb-and-max-product: Sampling and learning in discrete energy-based models

Finding Donors for CharityML

Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.

Birthday-problem - The birthday problem asks for the probability that, in a set of n randomly chosen people, at least two will share a birthday

CN24 is a complete semantic segmentation framework using fully convolutional networks

Rethinking Transformer-based Set Prediction for Object Detection

The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation

Few-shot NLP benchmark for unified, rigorous eval

Reimplementation of the paper "Attention, Learn to Solve Routing Problems!" in jax/flax.

Minimal implementation of Denoised Smoothing: A Provable Defense for Pretrained Classifiers in TensorFlow.

Resources for the Ki testnet challenge

we propose EfficientDerain for high-efficiency single-image deraining

Resilience from Diversity: Population-based approach to harden models against adversarial attacks

Perspective: Julia for Biologists

Gesture Volume Control Using OpenCV and MediaPipe

pytorch implementation of dftd2 & dftd3

Experiments with Fourier layers on simulation data.