an implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation using PyTorch

Last update: Dec 22, 2022

Overview

revisiting-sepconv

This is a reference implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation [1] using PyTorch. Given two frames, it will make use of adaptive convolution [2] in a separable manner [3] to interpolate the intermediate frame. Should you be making use of our work, please cite our paper [1].

For the original SepConv, see: https://github.com/sniklaus/sepconv-slomo
For softmax splatting, please see: https://github.com/sniklaus/softmax-splatting

setup

The separable convolution layer is implemented in CUDA using CuPy, which is why CuPy is a required dependency. It can be installed using pip install cupy or alternatively using one of the provided binary packages as outlined in the CuPy repository.

If you plan to process videos, then please also make sure to have pip install moviepy installed.

usage

To run it on your own pair of frames, use the following command.

python run.py --model paper --one ./images/one.png --two ./images/two.png --out ./out.png

To run in on a video, use the following command.

python run.py --model paper --video ./videos/car-turn.mp4 --out ./out.mp4

For a quick benchmark using examples from the Middlebury benchmark for optical flow, run python benchmark.py. You can use it to easily verify that the provided implementation runs as expected.

video

license

Please refer to the appropriate file within this repository.

references

[1]  @inproceedings{Niklaus_WACV_2021,
         author = {Simon Niklaus and Long Mai and Oliver Wang},
         title = {Revisiting Adaptive Convolutions for Video Frame Interpolation},
         booktitle = {IEEE Winter Conference on Applications of Computer Vision},
         year = {2021}
     }

[2]  @inproceedings{Niklaus_ICCV_2017,
         author = {Simon Niklaus and Long Mai and Feng Liu},
         title = {Video Frame Interpolation via Adaptive Separable Convolution},
         booktitle = {IEEE International Conference on Computer Vision},
         year = {2017}
     }

[3]  @inproceedings{Niklaus_CVPR_2017,
         author = {Simon Niklaus and Long Mai and Feng Liu},
         title = {Video Frame Interpolation via Adaptive Convolution},
         booktitle = {IEEE Conference on Computer Vision and Pattern Recognition},
         year = {2017}
     }

an implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation using PyTorch

Related tags

Overview

revisiting-sepconv

setup

usage

video

license

references

Owner

Simon Niklaus

Code for our paper at ECCV 2020: Post-Training Piecewise Linear Quantization for Deep Neural Networks

scAR (single-cell Ambient Remover) is a package for data denoising in single-cell omics.

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Advantage Actor Critic (A2C): jax + flax implementation

A modular, open and non-proprietary toolkit for core robotic functionalities by harnessing deep learning

Local trajectory planner based on a multilayer graph framework for autonomous race vehicles.

Part-Aware Data Augmentation for 3D Object Detection in Point Cloud

Let's create a tool to convert Thailand budget from PDF to CSV.

curl-impersonate: A special compilation of curl that makes it impersonate Chrome & Firefox

Source Code and data for my paper titled Linguistic Knowledge in Data Augmentation for Natural Language Processing: An Example on Chinese Question Matching

Ensemble Learning Priors Driven Deep Unfolding for Scalable Snapshot Compressive Imaging [PyTorch]

Public Implementation of ChIRo from "Learning 3D Representations of Molecular Chirality with Invariance to Bond Rotations"

A Large-Scale Dataset for Spinal Vertebrae Segmentation in Computed Tomography

Simple codebase for flexible neural net training

tensorflow code for inverse face rendering

Dataset and Code for ICCV 2021 paper "Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme"

ECLARE: Extreme Classification with Label Graph Correlations

GAN-based Matrix Factorization for Recommender Systems

Fast and exact ILP-based solvers for the Minimum Flow Decomposition (MFD) problem, and variants of it.

A tensorflow/keras implementation of StyleGAN to generate images of new Pokemon.