LSTC: Boosting Atomic Action Detection with Long-Short-Term Context

Last update: Oct 11, 2022

Related tags

Overview

LSTC: Boosting Atomic Action Detection with Long-Short-Term Context

This Repository contains the code on AVA of our ACM MM 2021 paper: LSTC: Boosting Atomic Action Detection with Long-Short-Term Context

Installation

See INSTALL.md for details on installing the codebase, including requirement and environment settings

Data

For data preparation and setup, our LSTC strictly follows the processing of PySlowFast, See DATASET.md for details on preparing the data.

Run the code

We take SlowFast-ResNet50 as an example

train the model

python3 tools/run_net.py --cfg config/AVA/SLOWFAST_32x12_R50_LFB.yaml \
    AVA.FEATURE_BANK_PATH 'path/to/feature/bank/folder' \
    TRAIN.CHECKPOINT_FILE_PATH 'path/to/pretrained/backbone' \
    OUTPUT_DIR 'path/to/output/folder'

test the model

python3 tools/run_net.py --cfg config/AVA/SLOWFAST_32x12_R50_LFB.yaml \
    AVA.FEATURE_BANK_PATH 'path/to/feature/bank/folder' \
    OUTPUT_DIR 'path/to/output/folder' \
    TRAIN.ENABLE False \ 
    TEST.ENABLE True

If you want to start the DDP training from command line with torch.distributed.launch, please set start_method='cmd' in tools/run_net.py

Resource

The codebase provide following resources for fast training and validation

Pretrained backbone on Kinetics

backbone	dataset	model type	link
ResNet50	Kinetics400	Caffe2	Google Drive/Baidu Disk (Code: y1wl)
ResNet101	Kinetics600	Caffe2	Google Drive/Baidu Disk (Code: slde)

Extracted long term feature bank

backbone	feature bank (LMDB)	dimension
ResNet50	Google Drive	1280
ResNet101	Google Drive	2304

Checkpoint file

backbone	checkpoint	model type
ResNet50	Google Drive/Baidu Disk (Code: fi0s)	pytorch
ResNet101	Google Drive/Baidu Disk (Code: g63o)	pytorch

Acknowledgement

This codebase is built upon PySlowFast.

Citation

If you find this repository helps your research, please refer following paper

@InProceedings{Yuxi_2021_ACM,
  author = {Li, Yuxi and Zhang, Boshen and Li, Jian and Wang, Yabiao and Wang, Chengjie and Li, Jilin and Huang, Feiyue and Lin, Weiyao},
  title = {LSTC: Boosting Atomic Action Detection with Long-Short-Term Context},
  booktitle = {ACM Conference on Multimedia},
  month = {October},
  year = {2021}
}

LSTC: Boosting Atomic Action Detection with Long-Short-Term Context

Related tags

Overview

LSTC: Boosting Atomic Action Detection with Long-Short-Term Context

Installation

Data

Run the code

Resource

Pretrained backbone on Kinetics

Extracted long term feature bank

Checkpoint file

Acknowledgement

Citation

Owner

Tencent YouTu Research

This is a project of data parallel that running on NLP tasks.

This repository contains helper functions which can help you generate additional data points depending on your NLP task.

Code for "Semantic Role Labeling as Dependency Parsing: Exploring Latent Tree Structures Inside Arguments".

Code for EmBERT, a transformer model for embodied, language-guided visual task completion.

Segmenter - Transformer for Semantic Segmentation

nlpcommon is a python Open Source Toolkit for text classification.

Automatic privilege escalation for misconfigured capabilities, sudo and suid binaries

Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"

AI-powered literature discovery and review engine for medical/scientific papers

Use PaddlePaddle to reproduce the paper：mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer

Twewy-discord-chatbot - Build a Discord AI Chatbot that Speaks like Your Favorite Character

NLP, before and after spaCy

Text to speech is a process to convert any text into voice. Text to speech project takes words on digital devices and convert them into audio. Here I have used Google-text-to-speech library popularly known as gTTS library to convert text file to .mp3 file. Hope you like my project!

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Data and code to support "Applied Natural Language Processing" (INFO 256, Fall 2021, UC Berkeley)

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

null

text to speech toolkit. 好用的中文语音合成工具箱，包含语音编码器、语音合成器、声码器和可视化模块。

100+ Chinese Word Vectors 上百种预训练中文词向量

NLP, Machine learning