Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Last update: Jul 08, 2021

Related tags

Overview

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

This is a PyTorch implementation of the model described in our paper:

Z. Qi, S. Wang, C. Su, L. Su, W. Zhang, and Q. Huang. Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis. ACM MM 2020.

Dependencies

Pytorch 1.2.0
Cuda 9.2.148
Cudnn 7.6.2
Opencv-python 4.2.0.34
Python 3.6.9

Data

Dataset Prepare

Download the pre-trained concept detector weights from Baidu passward 'wv0e' or Google Grive and put them in folder weights/
Download the FCVID dataset from http://bigvid.fudan.edu.cn/FCVID/.
The annotation information of each dataset is provided in folder data/FCVID/video_labels.
Extract the video frames for each video and put the extracted frames in folder data/FCVID/frames/.

For ActivityNet dataset ( http://activity-net.org/. ) , we use the latest released version of the dataset (v1.3).

Train

python main.py --gpu_ids 0,1 --model_name tdcmn_si_soa --dataset FCVID --no_test

for other hyperparameters, please refer to opts.py file.

Test

Pretrained model weigths are avaiable in Baidu passward 'szlk' or Google Grive
Download the pre-trained weights and put them in folder results/
python main.py --gpu_ids 0,1 --model_name tdcmn_si_soa --dataset FCVID --resume_path pretrained_model/tdcmn_si_soa.pth --no_train --test_crop_number 1

Citation

Please cite our paper if you use this code in your own work:

@inproceedings{qi2020modeling,
  title={Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis},
  author={Qi, Zhaobo and Wang, Shuhui and Su, Chi and Su, Li and Zhang, Weigang and Huang, Qingming},
  booktitle={Proceedings of the 28th ACM International Conference on Multimedia},
  pages={3798--3806},
  year={2020}
}

Contcat

If you have any problem about our code, feel free to contact

[email protected]

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Related tags

Overview

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Dependencies

Data

Dataset Prepare

Train

Test

Citation

Contcat

Owner

qzhb

This is the repository of the NeurIPS 2021 paper "Curriculum Disentangled Recommendation withNoisy Multi-feedback"

This project provides the proof of the uniqueness of the equilibrium and the global asymptotic stability.

Finite-temperature variational Monte Carlo calculation of uniform electron gas using neural canonical transformation.

🏎️ Accelerate training and inference of 🤗 Transformers with easy to use hardware optimization tools

PyTorch implementation of ShapeConv: Shape-aware Convolutional Layer for RGB-D Indoor Semantic Segmentation.

Code repository for "Reducing Underflow in Mixed Precision Training by Gradient Scaling" presented at IJCAI '20

Implementation of Graph Convolutional Networks in TensorFlow

[NeurIPS 2020] Semi-Supervision (Unlabeled Data) & Self-Supervision Improve Class-Imbalanced / Long-Tailed Learning

Graph Neural Networks with Keras and Tensorflow 2.

Code for the paper "Learning-Augmented Algorithms for Online Steiner Tree"

Baselines for TrajNet++

Deep Networks with Recurrent Layer Aggregation

Implementation for On Provable Benefits of Depth in Training Graph Convolutional Networks

Implementation and replication of ProGen, Language Modeling for Protein Generation, in Jax

Eye-Blink-Counter - Python based Computer Vision project which counts how many time a person blinks

Neurons Dataset API - The official dataloader and visualization tools for Neurons Datasets.

Machine learning framework for both deep learning and traditional algorithms

Tutorial page of the Climate Hack, the greatest hackathon ever

Establishing Strong Baselines for TripClick Health Retrieval; ECIR 2022

Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.