PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Last update: Dec 26, 2022

Related tags

Deep Learning Dancing2Music

Overview

Dancing to Music

PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Paper

Hsin-Ying Lee, Xiaodong Yang, Ming-Yu Liu, Ting-Chun Wang, Yu-Ding Lu, Ming-Hsuan Yang, Jan Kautz
Dancing to Music Neural Information Processing Systems (NeurIPS) 2019
[Paper] [YouTube] [Project] [Blog] [Supp]

Example Videos

Beat-Matching
1st row: generated dance sequences, 2nd row: music beats, 3rd row: kinematics beats

Multimodality
Generate various dance sequences with the same music and the same initial pose.

Long-Term Generation
Seamlessly generate a dance sequence with arbitrary length.

Photo-Realisitc Videos
Map generated dance sequences to photo-realistic videos.

Train Decomposition

python train_decomp.py --name Decomp

Train Composition

python train_comp.py --name Decomp --decomp_snapshot DECOMP_SNAPSHOT

Demo

python demo.py --decomp_snapshot DECOMP_SNAPSHOT --comp_snapshot COMP_SNAPSHOT --aud_path AUD_PATH --out_file OUT_FILE --out_dir OUT_DIR --thr THR

Flags
- aud_path: input .wav file
- out_file: location of output .mp4 file
- out_dir: directory of output frames
- thr: threshold based on motion magnitude
- modulate: whether to do beat warping
Example

python demo.py -decomp_snapshot snapshot/Stage1.ckpt --comp_snapshot snapshot/Stage2.ckpt --aud_path demo/demo.wav --out_file demo/out.mp4 --out_dir demo/out_frame

Citation

If you find this code useful for your research, please cite our paper:

@inproceedings{lee2019dancing2music,
  title={Dancing to Music},
  author={Lee, Hsin-Ying and Yang, Xiaodong and Liu, Ming-Yu and Wang, Ting-Chun and Lu, Yu-Ding and Yang, Ming-Hsuan and Kautz, Jan},
  booktitle={NeurIPS},
  year={2019}
}

License

Copyright (C) 2020 NVIDIA Corporation. All rights reserved. This work is made available under NVIDIA Source Code License (1-Way Commercial). To view a copy of this license, visit https://nvlabs.github.io/Dancing2Music/LICENSE.txt.

PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Related tags

Overview

Dancing to Music

Paper

Example Videos

Train Decomposition

Train Composition

Demo

Citation

License

Owner

NVIDIA Research Projects

Official code of paper: MovingFashion: a Benchmark for the Video-to-Shop Challenge

Face Synthetics dataset is a collection of diverse synthetic face images with ground truth labels.

Build a medical knowledge graph based on Unified Language Medical System (UMLS)

End-to-End Speech Processing Toolkit

LOFO (Leave One Feature Out) Importance calculates the importances of a set of features based on a metric of choice,

ilpyt: imitation learning library with modular, baseline implementations in Pytorch

Image-Adaptive YOLO for Object Detection in Adverse Weather Conditions

EfficientMPC - Efficient Model Predictive Control Implementation

[TIP 2021] SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction

Generating Anime Images by Implementing Deep Convolutional Generative Adversarial Networks paper

Unofficial implementation of PatchCore anomaly detection

Unofficial PyTorch implementation of Google AI's VoiceFilter system

The first dataset of composite images with rationality score indicating whether the object placement in a composite image is reasonable.

clustimage is a python package for unsupervised clustering of images.

Code for "Learning Structural Edits via Incremental Tree Transformations" (ICLR'21)

Adversarial Learning for Modeling Human Motion

PyTorch implementation of our paper How robust are discriminatively trained zero-shot learning models?

DeepSpamReview: Detection of Fake Reviews on Online Review Platforms using Deep Learning Architectures. Summer Internship project at CoreView Systems.

Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data

Official implementation for "QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation" (CVPR 2022)