3D Avatar Lip Syncronization from speech (JALI based face-rigging)

Last update: Dec 20, 2022

Overview

visemenet-inference

Inference Demo of "VisemeNet-tensorflow"
- VisemeNet is an audio-driven animator centric speech animation driving a JALI or standard FACS-based face-rigging from input audio.
- The original repo is outdated and difficult to setup the environment for testing the pretrained model. This code is to provide a super-clean inference module based on the original author's repo.

How to freeze graph

This repo does not need bazel-build for "freeze-graph" function
Thanks to https://github.com/lighttransport/VisemeNet-infer for giving some examples.

Requirements

Python 3.6.x using "pyenv"
Tensorflow 1.1.0

Setup the envs and packages

# Install Virtualenv using pyenv
pyenv install 3.6.5
pyenv virtualenv 3.6.5 visemenet-freeze
pyenv activate visemenet-freeze

# Install packages
pip install tensorflow==1.1.0

Clone the repo

# Clone Visemenet repo and the pretrained model
git clone https://github.com/yzhou359/VisemeNet_tensorflow.git
curl -L https://www.dropbox.com/sh/7nbqgwv0zz8pbk9/AAAghy76GVYDLqPKdANcyDuba?dl=0 > pretrained_model.zip
unzip prtrained_model.zip -d VisemeNet_tensorflow/data/ckpt/pretrain_biwi/

Freeze Graph and Save as pb

# Freeze Graph
python freeze_graph.py

Model Inference

Colab Demo

This code provides the simple and clean inference code without any needless ones
It's compatible with TF 2.0 Version

Requirements

Tensorflow 2.x
numpy
scipy
python_speech_features

How to run inference

import numpy as np
from inference import VisemeRegressor

pb_filepath = "./visemenet_frozen.pb"
wav_file_path = "./test_audio.wav"
out_txt_path = "./maya_viseme_outputs.txt"

viseme_regressor = VisemeRegressor(pb_filepath=pb_filepath)

viseme_outputs = viseme_regressor.predict_outputs(wav_file_path=wav_file_path)

np.savetxt(out_txt_path, viseme_outputs, '%.4f')

3D Avatar Lip Syncronization from speech (JALI based face-rigging)

Related tags

Overview

visemenet-inference

How to freeze graph

Requirements

Model Inference

Requirements

How to run inference

Owner

Junhwan Jang

GCC: Graph Contrastive Coding for Graph Neural Network Pre-Training @ KDD 2020

Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Pytorch and Torch testing code of CartoonGAN

Continual Learning of Electronic Health Records (EHR).

The official repository for "Score Transformer: Generating Musical Scores from Note-level Representation" (MMAsia '21)

Tensorflow-seq2seq-tutorials - Dynamic seq2seq in TensorFlow, step by step

PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

Bio-Computing Platform Featuring Large-Scale Representation Learning and Multi-Task Deep Learning “螺旋桨”生物计算工具集

Liecasadi - liecasadi implements Lie groups operation written in CasADi

Motion and Shape Capture from Sparse Markers

Fuzzification helps developers protect the released, binary-only software from attackers who are capable of applying state-of-the-art fuzzing techniques

WeakVRD-Captioning - Implementation of paper Improving Image Captioning with Better Use of Caption

Spline is a tool that is capable of running locally as well as part of well known pipelines like Jenkins (Jenkinsfile), Travis CI (.travis.yml) or similar ones.

An implementation of the BADGE batch active learning algorithm.

Zero-shot Synthesis with Group-Supervised Learning (ICLR 2021 paper)

Code release of paper "Deep Multi-View Stereo gone wild"

A smaller subset of 10 easily classified classes from Imagenet, and a little more French

Python implementation of "Single Image Haze Removal Using Dark Channel Prior"

Official implementation of the ICCV 2021 paper "Conditional DETR for Fast Training Convergence".

SFD implement with pytorch