PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Last update: Jan 03, 2023

Related tags

Overview

PySlowFast

PySlowFast is an open source video understanding codebase from FAIR that provides state-of-the-art video classification models with efficient training. This repository includes implementations of the following methods:

Introduction

The goal of PySlowFast is to provide a high-performance, light-weight pytorch codebase provides state-of-the-art video backbones for video understanding research on different tasks (classification, detection, and etc). It is designed in order to support rapid implementation and evaluation of novel video research ideas. PySlowFast includes implementations of the following backbone network architectures:

SlowFast
Slow
C2D
I3D
Non-local Network
X3D

Updates

We now support Multiscale Vision Transformers on Kinetics and ImageNet. See projects/mvit for more information.
We now support PyTorchVideo models and datasets. See projects/pytorchvideo for more information.
We now support X3D Models. See projects/x3d for more information.
We now support Multigrid Training for efficiently training video models. See projects/multigrid for more information.
PySlowFast is released in conjunction with our ICCV 2019 Tutorial.

License

PySlowFast is released under the Apache 2.0 license.

Model Zoo and Baselines

We provide a large set of baseline results and trained models available for download in the PySlowFast Model Zoo.

Installation

Please find installation instructions for PyTorch and PySlowFast in INSTALL.md. You may follow the instructions in DATASET.md to prepare the datasets.

Quick Start

Follow the example in GETTING_STARTED.md to start playing video models with PySlowFast.

Visualization Tools

We offer a range of visualization tools for the train/eval/test processes, model analysis, and for running inference with trained model. More information at Visualization Tools.

Contributors

PySlowFast is written and maintained by Haoqi Fan, Yanghao Li, Bo Xiong, Wan-Yen Lo, Christoph Feichtenhofer.

Citing PySlowFast

If you find PySlowFast useful in your research, please use the following BibTeX entry for citation.

@misc{fan2020pyslowfast,
  author =       {Haoqi Fan and Yanghao Li and Bo Xiong and Wan-Yen Lo and
                  Christoph Feichtenhofer},
  title =        {PySlowFast},
  howpublished = {\url{https://github.com/facebookresearch/slowfast}},
  year =         {2020}
}

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Related tags

Overview

PySlowFast

Introduction

Updates

License

Model Zoo and Baselines

Installation

Quick Start

Visualization Tools

Contributors

Citing PySlowFast

Owner

Meta Research

SCALE: Modeling Clothed Humans with a Surface Codec of Articulated Local Elements (CVPR 2021)

Robot Servers and Server Manager software for robo-gym

[ICCV'21] Neural Radiance Flow for 4D View Synthesis and Video Processing

Learning Chinese Character style with conditional GAN

GraPE is a Rust/Python library for high-performance Graph Processing and Embedding.

Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image of a face as input and produces a 3D reconstruction. EMOCA sets the new standard on reconstructing highly emotional images in-the-wild

MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens

Search Youtube Video and Get Video info

Implementation of "Selection via Proxy: Efficient Data Selection for Deep Learning" from ICLR 2020.

discovering subdomains, hidden paths, extracting unique links

PyTorch implementation of "Optimization Planning for 3D ConvNets"

Classify the disease status of a plant given an image of a passion fruit

BirdCLEF 2021 - Birdcall Identification 4th place solution

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

The Ludii general game system, developed as part of the ERC-funded Digital Ludeme Project.

An Unsupervised Graph-based Toolbox for Fraud Detection

TensorFlow (Python API) implementation of Neural Style

Exploring whether attention is necessary for vision transformers

Testing and Estimation of structural breaks in Stata

HashNeRF-pytorch - Pure PyTorch Implementation of NVIDIA paper on Instant Training of Neural Graphics primitives