GitHub repository for "Improving Video Generation for Multi-functional Applications"

Last update: Dec 07, 2022

Related tags

Overview

Improving Video Generation for Multi-functional Applications

GitHub repository for "Improving Video Generation for Multi-functional Applications"

Paper Link

For more information please refer to our homepage.

Requirements

Tensorflow 1.2.1
Python 2.7
ffmpeg

Data Format

Videos are stored as JPEGs of vertically stacked frames. Every frame needs to be at least 64x64 pixels; videos contain between 16 and 32 frames. For an example datasets see: http://carlvondrick.com/tinyvideo/#data

Training

python main_train.py

Important Parameters:

mode: one of 'generate', 'predict', 'bw2rgb', 'inpaint' depending on weather you want to generate videos, predict future frames, colorize videos or do inpainting.
batch_size: Recommended 64, for colorization use 32 for memory issues.
root_dir: root directory of dataset
index_file: must be in root_dir, containing a list of all training data clips; path relative to root_dir.
experiment_name: name of experiment
output_every: output loss to stdout and write to tensorboard summary every xx steps.
sample_every: generate a visual sample every xx steps.
save_model_very: save the model every xx steps.
recover_model: if true recover model and continue training

GitHub repository for "Improving Video Generation for Multi-functional Applications"

Related tags

Overview

Improving Video Generation for Multi-functional Applications

Requirements

Data Format

Training

Owner

Bernhard Kratzwald

Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning

Source code of our BMVC 2021 paper: AniFormer: Data-driven 3D Animation with Transformer

Improving Non-autoregressive Generation with Mixup Training

This repository contains the code for: RerrFact model for SciVer shared task

A PyTorch implementation of "SimGNN: A Neural Network Approach to Fast Graph Similarity Computation" (WSDM 2019).

A simple, fully convolutional model for real-time instance segmentation.

End-to-end Temporal Action Detection with Transformer. [Under review]

tmm_fast is a lightweight package to speed up optical planar multilayer thin-film device computation.

PyTorch reimplementation of Diffusion Models

A PyTorch Implementation of Gated Graph Sequence Neural Networks (GGNN)

Norm-based Analysis of Transformer

Vrcwatch - Supply the local time to VRChat as Avatar Parameters through OSC

Implementations of the algorithms in the paper Approximative Algorithms for Multi-Marginal Optimal Transport and Free-Support Wasserstein Barycenters

Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

Neural Cellular Automata + CLIP

Iowa Project - My second project done at General Assembly, focused on feature engineering and understanding Linear Regression as a concept

Analyses of the individual electric field magnitudes with Roast.

Run PowerShell command without invoking powershell.exe

Code for the TPAMI paper: "Syntax Customized Video Captioning by Imitating Exemplar Sentences"

Official code for 'Pixel-wise Energy-biased Abstention Learning for Anomaly Segmentationon Complex Urban Driving Scenes'