Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 2021

Last update: Dec 07, 2022

Related tags

Deep Learning SELFY

Overview

Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition

This is the official implementation of the paper "Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition" by H.Kwon, M.Kim, S.Kwak, and M.Cho. For more information, checkout the project website and the paper on arXiv.

Environment:

Cuda: 9.0
gcc: 7.3.0
Python 3.6.8
PyTorch 1.0.1
TorchVison: 0.2.2
Spatial Correlation Sampler
Others: environment.yml

Anaconda environment setting

git clone https://github.com/arunos728/SELFY.git
cd selfy
conda env create -f environment.yml
conda activate selfy

Installing Correlation sampler

cd Pytorch-Correlation-extension
python setup.py install

# check whether SpatialCorrelationSampler is installed correctly.
python check.py forward
python check.py backward
python checkCorrelationSampler.py

Please check this repo for the detailed instructions.

Dataset preparation

Please refer to TSM repo for the detailed data preparation instructions.

File lists (.txt files in ./data) specify configurations of each video clips (path, #frames, class). We upload our Something-Something-V1 & V2 video file lists in ./data. The path of the file lists should be added into the scripts for training (or testing).

Training & Testing

For training SELFYNet on Something-Something, use the following command:

    ./scripts/train_SELFY_Something.sh

For testing your trained model on Something-Something, use the following command:

    ./scripts/test_SELFY_Something.sh

Citation

If you use this code or ideas from the paper for your research, please cite our paper:

@inproceedings{kwon2021learning,
  title={Learning self-similarity in space and time as generalized motion for video action recognition},
  author={Kwon, Heeseung and Kim, Manjin and Kwak, Suha and Cho, Minsu},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={13065--13075},
  year={2021}
}

Contact

Heeseung Kwon([email protected]), Manjin Kim([email protected])

Questions can also be left as issues in the repository. We will be happy to answer them.

Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 2021

Related tags

Overview

Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition

Environment:

Anaconda environment setting

Installing Correlation sampler

Dataset preparation

Training & Testing

Citation

Contact

Owner

Code for the ICCV2021 paper "Personalized Image Semantic Segmentation"

PyTorch implementation for the paper Pseudo Numerical Methods for Diffusion Models on Manifolds

FastFCN: Rethinking Dilated Convolution in the Backbone for Semantic Segmentation.

AnimationKit: AI Upscaling & Interpolation using Real-ESRGAN+RIFE

Gym for multi-agent reinforcement learning

A gesture recognition system powered by OpenPose, k-nearest neighbours, and local outlier factor.

Official PyTorch implementation of "Preemptive Image Robustification for Protecting Users against Man-in-the-Middle Adversarial Attacks" (AAAI 2022)

N-gram models- Unsmoothed, Laplace, Deleted Interpolation

Official implementation of ETH-XGaze dataset baseline

Faster Convex Lipschitz Regression

source code of Adversarial Feedback Loop Paper

Continual learning with sketched Jacobian approximations

Image based Human Fall Detection

Towhee is a flexible machine learning framework currently focused on computing deep learning embeddings over unstructured data.

Photo2cartoon - 人像卡通化探索项目 (photo-to-cartoon translation project)

Keras implementation of the GNM model in paper ’Graph-Based Semi-Supervised Learning with Nonignorable Nonresponses‘

[NeurIPS 2021] The PyTorch implementation of paper "Self-Supervised Learning Disentangled Group Representation as Feature"

This repository contains FEDOT - an open-source framework for automated modeling and machine learning (AutoML)

Official implementation of Deep Reparametrization of Multi-Frame Super-Resolution and Denoising

nnFormer: Interleaved Transformer for Volumetric Segmentation