Learning Representational Invariances for Data-Efficient Action Recognition

Last update: Nov 22, 2022

Overview

Learning Representational Invariances for Data-Efficient Action Recognition

Official PyTorch implementation for Learning Representational Invariances for Data-Efficient Action Recognition. We follow the code structure of MMAction2.

See the project page for more details.

Installation

We use PyTorch-1.6.0 with CUDA-10.2 and Torchvision-0.7.0.

Please refer to install.md for installation.

Data Preparation

First, please download human detection results and put them in the corresponding folder under data: UCF-101, HMDB-51, Kinetics-100.

Second, please refer to data_preparation.md to prepare raw frames of UCF-101 and HMDB-51. (Instructions of extracting frames from Kinetics-100 will be available soon.)

(Optional) You can download the pre-extracted ImageNet scores: UCF-101, HMDB-51.

Training

We use 8 RTX2080 Ti GPUs to run our experiments. You would need to adjust your training schedule accordingly if you have less GPUs. Please refer to here.

Supervised learning

PORT=${PORT:-29500}

python -m torch.distributed.launch \
--nproc_per_node=8 \
--master_port=$PORT \
tools/train.py \
$CONFIG \
--launcher pytorch ${@:3} \
--validate

You need to replace $CONFIG with the actual config file:

For supervised baseline, please use config files in configs/recognition/r2plus1d.
For strongly-augmented supervised learning, please use config files in configs/supervised_aug.

Semi-supervised learning

PORT=${PORT:-29500}

python -m torch.distributed.launch \
--nproc_per_node=8 \
--master_port=$PORT \
tools/train_semi.py \
$CONFIG \
--launcher pytorch ${@:3} \
--validate

You need to replace $CONFIG with the actual config file:

For single dataset semi-supervised learning, please use config files in configs/semi.
For cross-dataset semi-supervised learning, please use config files in configs/semi_both.

Testing

# Multi-GPU testing
./tools/dist_test.sh $CONFIG ${path_to_your_ckpt} ${num_of_gpus} --eval top_k_accuracy

# Single-GPU testing
python tools/test.py $CONFIG ${path_to_your_ckpt} --eval top_k_accuracy

NOTE: Do not use multi-GPU testing if you are currently using multi-GPU training.

Other details

Please see getting_started.md for the basic usage of MMAction2.

Acknowledgement

Codes are built upon MMAction2.

Learning Representational Invariances for Data-Efficient Action Recognition

Related tags

Overview

Learning Representational Invariances for Data-Efficient Action Recognition

Installation

Data Preparation

Training

Supervised learning

Semi-supervised learning

Testing

Other details

Acknowledgement

Owner

Virginia Tech Vision and Learning Lab

QueryInst: Parallelly Supervised Mask Query for Instance Segmentation

Clockwork Variational Autoencoder

A voice recognition assistant similar to amazon alexa, siri and google assistant.

QHack—the quantum machine learning hackathon

TorchOk - The toolkit for fast Deep Learning experiments in Computer Vision

Training DiffWave using variational method from Variational Diffusion Models.

Digan - Official PyTorch implementation of Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks

The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"

Imbalanced Gradients: A Subtle Cause of Overestimated Adversarial Robustness

Create UIs for prototyping your machine learning model in 3 minutes

CIFAR-10 Photo Classification

Official pytorch implementation of "Scaling-up Disentanglement for Image Translation", ICCV 2021.

A highly modular PyTorch framework with a focus on Neural Architecture Search (NAS).

Keras code and weights files for popular deep learning models.

(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain

Optical machine for senses sensing using speckle and deep learning

Task-based end-to-end model learning in stochastic optimization

EfficientNetV2 implementation using PyTorch

Keras Implementation of The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation by (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio)

Creating Multi Task Models With Keras