[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Last update: Dec 30, 2022

Overview

[CVPR2022] Thin-Plate Spline Motion Model for Image Animation

Source code of the CVPR'2022 paper "Thin-Plate Spline Motion Model for Image Animation"

Example animation

PS: The paper trains the model for 100 epochs for a fair comparison. You can use more data and train for more epochs to get better performance.

Web demo for animation

Try the web demo for animation here:
Google Colab:

Pre-trained models

Installation

We support python3.(Recommended version is Python 3.9). To install the dependencies run:

pip install -r requirements.txt

YAML configs

There are several configuration files one for each dataset in the config folder named as config/dataset_name.yaml.

See description of the parameters in the config/taichi-256.yaml.

Datasets

MGif. Follow Monkey-Net.
TaiChiHD and VoxCeleb. Follow instructions from video-preprocessing.
TED-talks. Follow instructions from MRAA.

Training

To train a model on specific dataset run:

CUDA_VISIBLE_DEVICES=0,1 python run.py --config config/dataset_name.yaml --device_ids 0,1

A log folder named after the timestamp will be created. Checkpoints, loss values, reconstruction results will be saved to this folder.

Training AVD network

To train a model on specific dataset run:

CUDA_VISIBLE_DEVICES=0 python run.py --mode train_avd --checkpoint '{checkpoint_folder}/checkpoint.pth.tar' --config config/dataset_name.yaml

Checkpoints, loss values, reconstruction results will be saved to {checkpoint_folder}.

Evaluation on video reconstruction

To evaluate the reconstruction performance run:

CUDA_VISIBLE_DEVICES=0 python run.py --mode reconstruction --config config/dataset_name.yaml --checkpoint '{checkpoint_folder}/checkpoint.pth.tar'

The reconstruction subfolder will be created in {checkpoint_folder}. The generated video will be stored to this folder, also generated videos will be stored in png subfolder in loss-less '.png' format for evaluation. To compute metrics, follow instructions from pose-evaluation.

Image animation demo

notebook: demo.ipynb, edit the config cell and run for image animation.
python:

CUDA_VISIBLE_DEVICES=0 python demo.py --config config/vox-256.yaml --checkpoint checkpoints/vox.pth.tar --source_image ./source.jpg --driving_video ./driving.mp4

Acknowledgments

The main code is based upon FOMM and MRAA

Thanks for the excellent works!

Thanks iperov, this work has been integrated in DeepFaceLive

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Related tags

Overview

[CVPR2022] Thin-Plate Spline Motion Model for Image Animation

Example animation

Web demo for animation

Pre-trained models

Installation

YAML configs

Datasets

Training

Training AVD network

Evaluation on video reconstruction

Image animation demo

Acknowledgments

Owner

yoyo-nb

Pytorch Code for "Medical Transformer: Gated Axial-Attention for Medical Image Segmentation"

Generative Adversarial Networks(GANs)

A High-Performance Distributed Library for Large-Scale Bundle Adjustment

pytorch implementation of ABC : Auxiliary Balanced Classifier for Class-imbalanced Semi-supervised Learning

mlpack: a scalable C++ machine learning library --

Code of Periodic Activation Functions Induce Stationarity

Spectrum Surveying: Active Radio Map Estimation with Autonomous UAVs

Fully Convlutional Neural Networks for state-of-the-art time series classification

A toolkit for developing and comparing reinforcement learning algorithms.

This repo contains the code required to train the multivariate time-series Transformer.

A best practice for tensorflow project template architecture.

A Player for Kanye West's Stem Player. Sort of an emulator.

Approaches to modeling terrain and maps in python

A framework for multi-step probabilistic time-series/demand forecasting models

Package for working with hypernetworks in PyTorch.

A motion detection system with RaspberryPi, OpenCV, Python

Code, pre-trained models and saliency results for the paper "Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images".

Office source code of paper UniFuse: Unidirectional Fusion for 360$^\circ$ Panorama Depth Estimation

GANimation: Anatomically-aware Facial Animation from a Single Image (ECCV'18 Oral) [PyTorch]