Deep Sketch-guided Cartoon Video Inbetweening

Last update: Dec 22, 2022

Related tags

Overview

Cartoon Video Inbetweening

Paper | DOI | Video

The source code of Deep Sketch-guided Cartoon Video Inbetweening by Xiaoyu Li, Bo Zhang, Jing Liao, Pedro V. Sander, IEEE Transactions on Visualization and Computer Graphics, 2021.

Prerequisites

Linux or Windows
Python 3
CPU or NVIDIA GPU + CUDA CuDNN

Use the Pre-trained Models

You can download the pre-trained model here.

Run the following commands for evaluating the frame synthesis model and full model:

python eval_synthesis.py
python eval_full.py

The frame synthesis model takes img_0, img_1, ske_t as inputs and synthesizes img_t. The full model takes img_0, img_1, ske_t as inputs and interpolates five frames between img_0 and img_1.

Datasets

A dataset is a directory with the following structure:

dataset
    ├── frame
    │   └── ${clip_id}
    │       └──${image_id}.png
    ├── sketch
    │   └── ${clip_id}
    │       └──${image_id}.png
    └── dismap
        └── ${clip_id}
            └──${image_id}.npy

The sketch images can be generated by the script "sketch.py" and the distance maps can be generated by "dismap.py". Due to the copyright issue of the movie Spirited Away, we can not release our training dataset. You can generate your own dataset if you interest.

Training

Run the following command for training the frame synthesis model and full model:

python train_synthesis.py
python train_full.py

Before you train the full model, you must train the frame synthesis model first and use its parameters to initialize the full model.

Citing

If you find our work useful, please consider citing:

@article{li2021deep,
  author    = {Li, Xiaoyu and Zhang, Bo and Liao, Jing and Sander, Pedro},
  journal   = {IEEE Transactions on Visualization and Computer Graphics},
  year      = {2021},
  publisher = {IEEE}
}

Deep Sketch-guided Cartoon Video Inbetweening

Related tags

Overview

Cartoon Video Inbetweening

Paper | DOI | Video

Prerequisites

Use the Pre-trained Models

Datasets

Training

Citing

Owner

Xiaoyu Li

POPPY (Physical Optics Propagation in Python) is a Python package that simulates physical optical propagation including diffraction

Code release for The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification (TIP 2020)

Keras-1D-NN-Classifier

A Python reference implementation of the CF data model

AI-Fitness-Tracker - AI Fitness Tracker With Python

PyTorch implementation of the paper: Long-tail Learning via Logit Adjustment

Model that predicts the probability of a Twitter user being anti-vaccination.

The official repository for BaMBNet

SEC'21: Sparse Bitmap Compression for Memory-Efficient Training onthe Edge

Hard cater examples from Hopper ICLR paper

Explore extreme compression for pre-trained language models

[ICCV'21] PlaneTR: Structure-Guided Transformers for 3D Plane Recovery

you can add any codes in any language by creating its respective folder (if already not available).

Code for "Learning to Regrasp by Learning to Place"

Spatio-Temporal Entropy Model (STEM) for end-to-end leaned video compression.

Self-supervised learning algorithms provide a way to train Deep Neural Networks in an unsupervised way using contrastive losses

Learning Lightweight Low-Light Enhancement Network using Pseudo Well-Exposed Images

A pytorch reproduction of { Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation }.

particle tracking model, works with the ROMS output file(qck.nc, his.nc)

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement