Multi-modal Content Creation Model Training Infrastructure including the FACT model (AI Choreographer) implementation.

Last update: Dec 30, 2022

Related tags

Deep Learning mint

Overview

AI Choreographer: Music Conditioned 3D Dance Generation with AIST++ [ICCV-2021].

Overview

This package contains the model implementation and training infrastructure of our AI Choreographer.

Get started

Pull the code

git clone https://github.com/liruilong940607/mint --recursive

Note here --recursive is important as it will automatically clone the submodule (orbit) as well.

Install dependencies

conda create -n mint python=3.7
conda activate mint
conda install protobuf numpy
pip install tensorflow absl-py tensorflow-datasets librosa

sudo apt-get install libopenexr-dev
pip install --upgrade OpenEXR
pip install tensorflow-graphics tensorflow-graphics-gpu

git clone https://github.com/arogozhnikov/einops /tmp/einops
cd /tmp/einops/ && pip install . -U

git clone https://github.com/google/aistplusplus_api /tmp/aistplusplus_api
cd /tmp/aistplusplus_api && pip install -r requirements.txt && pip install . -U

Note if you meet environment conflicts about numpy, you can try with pip install numpy==1.20.

Get the data

See the website

Get the checkpoint

Download from google drive here, and put them to the folder ./checkpoints/

Run the code

complie protocols

protoc ./mint/protos/*.proto

preprocess dataset into tfrecord

python tools/preprocessing.py \
    --anno_dir="/mnt/data/aist_plusplus_final/" \
    --audio_dir="/mnt/data/AIST/music/" \
    --split=train
python tools/preprocessing.py \
    --anno_dir="/mnt/data/aist_plusplus_final/" \
    --audio_dir="/mnt/data/AIST/music/" \
    --split=testval

run training

python trainer.py --config_path ./configs/fact_v5_deeper_t10_cm12.config --model_dir ./checkpoints

Note you might want to change the batch_size in the config file if you meet OUT-OF-MEMORY issue.

run testing and evaluation

# caching the generated motions (seed included) to `./outputs`
python evaluator.py --config_path ./configs/fact_v5_deeper_t10_cm12.config --model_dir ./checkpoints
# calculate FIDs
python tools/calculate_scores.py

Citation

@inproceedings{li2021dance,
  title={AI Choreographer: Music Conditioned 3D Dance Generation with AIST++},
  author={Ruilong Li and Shan Yang and David A. Ross and Angjoo Kanazawa},
  booktitle = {The IEEE International Conference on Computer Vision (ICCV)},
  year = {2021}
}

Multi-modal Content Creation Model Training Infrastructure including the FACT model (AI Choreographer) implementation.

Related tags

Overview

AI Choreographer: Music Conditioned 3D Dance Generation with AIST++ [ICCV-2021].

Overview

Get started

Pull the code

Install dependencies

Get the data

Get the checkpoint

Run the code

Citation

Owner

Google Research

FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.

A custom DeepStack model for detecting 16 human actions.

An Unpaired Sketch-to-Photo Translation Model

A scikit-learn compatible neural network library that wraps PyTorch

This repo provides the base code for pytorch-lightning and weight and biases simultaneous integration.

ViDT: An Efficient and Effective Fully Transformer-based Object Detector

Expressive Power of Invariant and Equivaraint Graph Neural Networks (ICLR 2021)

This program presents convolutional kernel density estimation, a method used to detect intercritical epilpetic spikes (IEDs)

(SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’

Official Tensorflow implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection"

ESP32 python application to read data from a Tilt™ Hydrometer for homebrewing

MicroNet: Improving Image Recognition with Extremely Low FLOPs (ICCV 2021)

Source code of generalized shuffled linear regression

High-performance moving least squares material point method (MLS-MPM) solver.

HiPAL: A Deep Framework for Physician Burnout Prediction Using Activity Logs in Electronic Health Records

Subdivision-based Mesh Convolutional Networks

CNN visualization tool in TensorFlow

Bio-OFC gym implementation and Gym-Fly environment

An implementation of the paper "A Neural Algorithm of Artistic Style"

Unsupervised Representation Learning by Invariance Propagation