Unofficial implementation of One-Shot Free-View Neural Talking Head Synthesis

Last update: Dec 30, 2022

Related tags

Overview

face-vid2vid

Usage

Dataset Preparation

cd datasets
wget https://yt-dl.org/downloads/latest/youtube-dl -O youtube-dl
chmod a+rx youtube-dl
python load_videos.py --workers=8
cd ..

Pretrained Headpose Estimator

300W-LP, alpha 1, robust to image quality

Put hopenet_robust_alpha1.pkl here

Train

python train.py --batch_size=4 --gpu_ids=0,1,2,3 --num_epochs=100 (--ckp=10)

On 2080Ti, setting batch_size=4 makes up gpu memory

Evaluate

Reconstruction：

python evaluate.py --ckp=99 --source=r --driving=datasets/vox/test/id10280#NXjT3732Ekg#001093#001192.mp4

The first frame is used as source by default

Motion transfer：

python evaluate.py --ckp=99 --source=test.png --driving=datasets/vox/test/id10280#NXjT3732Ekg#001093#001192.mp4

Example after training for 7 days on 4 2080Ti:

Face Frontalization：

python evaluate.py --ckp=99 --source=f --driving=datasets/vox/train/id10192#S5yV10aCP7A#003200#003334.mp4

Acknowlegement

Thanks to NV, Imaginaire, AliaksandrSiarohin and DeepHeadPose

Unofficial implementation of One-Shot Free-View Neural Talking Head Synthesis

Related tags

Overview

face-vid2vid

Usage

Dataset Preparation

Pretrained Headpose Estimator

Train

Evaluate

Acknowlegement

Owner

worstcoder

disentanglement_lib is an open-source library for research on learning disentangled representations.

PyTorch code for the ICCV'21 paper: "Always Be Dreaming: A New Approach for Class-Incremental Learning"

State-of-the-art data augmentation search algorithms in PyTorch

PyTorch implementation of paper A Fast Knowledge Distillation Framework for Visual Recognition.

Generalized hybrid model for mode-locked laser diodes with an extended passive cavity

Generative Handwriting using LSTM Mixture Density Network with TensorFlow

The Power of Scale for Parameter-Efficient Prompt Tuning

Multi-Task Learning as a Bargaining Game

Torchserve server using a YoloV5 model running on docker with GPU and static batch inference to perform production ready inference.

This repository allows the user to automatically scale a 3D model/mesh/point cloud on Agisoft Metashape

Contenido del curso Bases de datos del DCC PUC versión 2021-2

A python3 tool to take a 360 degree survey of the RF spectrum (hamlib + rotctld + RTL-SDR/HackRF)

An Straight Dilated Network with Wavelet for image Deblurring

Lorien: A Unified Infrastructure for Efficient Deep Learning Workloads Delivery

Permute Me Softly: Learning Soft Permutations for Graph Representations

A PyTorch Implementation of ViT (Vision Transformer)

MogFace: Towards a Deeper Appreciation on Face Detection

More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval

Coded illumination for improved lensless imaging

Image Captioning using CNN and Transformers