PyTorch implementation of "Optimization Planning for 3D ConvNets"

Last update: Jan 12, 2022

Overview

Optimization-Planning-for-3D-ConvNets

Code for the ICML 2021 paper: Optimization Planning for 3D ConvNets.

Authors: Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Tao Mei

1. Requirement

The provided codes have been tested with Python-3.9.5 & Pytorch-1.9.0 on four Tesla-V100s.

2. Project structure

├─ base_config             # Pre-set config file for each dataset
├─ dataset                 # Video lists (NOT provided) and code to load video data
├─ jpgs                    # Images for README
├─ layers                  # Custom network layers
├─ model                   # Network architectures
├─ record                  # Config file for each run
├─ utils                   # Basic functions
├─ extract_score_3d.py     # Main script to extract predicted score
├─ helpers.py              # Helper functions for main scripts
├─ merge_score.py          # Main script to merge scores from different clips
├─ train_3d.py             # Main script to launch a training using given strategy
├─ train_3d_op.py          # Main script to launch a searching of best strategy
└─ run.sh                  # Shell script for training-extracting-merging pipeline

3. Run the code

Pre-process the target dataset and put the lists in to the dataset folder. Codes in dataset/video_dataset.py can load three video formats (raw video, jpeg frames and video LMDB) and can be simply modified to support the custom format.
Make config file in the record folder. The config examples include op-*.yml for pre-searched strategy, kinetics-*.yml for simple strategy on Kinetics-400,
Run run.sh for the training-extracting-merging pipeline or replace train_3d.py with train_3d_op.py for searching the optimal strategy.

4. TO DO

Add more explainations and examples.

5. Contact

Please feel free to email to Zhaofan Qiu if you have any question regarding the paper or any suggestions for further improvements.

6. Citation

If you find this code helpful, thanks for citing our work as

@inproceedings{qiu2021optimization,
title={Optimization Planning for 3D ConvNets},
author={Qiu, Zhaofan and Yao, Ting and Ngo, Chong-Wah and Mei, Tao},
booktitle={Proceedings of the 38th International Conference on Machine Learning (ICML)},
publisher={PMLR},
year={2021}
}

Please also pay attention to the citations of the included networks/algorithms.

PyTorch implementation of "Optimization Planning for 3D ConvNets"

Related tags

Overview

Optimization-Planning-for-3D-ConvNets

Code for the ICML 2021 paper: Optimization Planning for 3D ConvNets.

Authors: Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Tao Mei

1. Requirement

2. Project structure

3. Run the code

4. TO DO

5. Contact

6. Citation

Owner

Zhaofan Qiu

Auto-Lama combines object detection and image inpainting to automate object removals

End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)

[CVPR 2022] Structured Sparse R-CNN for Direct Scene Graph Generation

Official repository for HOTR: End-to-End Human-Object Interaction Detection with Transformers (CVPR'21, Oral Presentation)

MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution (CVPR2021)

Multi-scale discriminator feature-wise loss function

JAXDL: JAX (Flax) Deep Learning Library

Learning to See by Looking at Noise

SoGCN: Second-Order Graph Convolutional Networks

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021

Automated Evidence Collection for Fake News Detection

This repository contains datasets and baselines for benchmarking Chinese text recognition.

A Survey on Deep Learning Technique for Video Segmentation

TorchOk - The toolkit for fast Deep Learning experiments in Computer Vision

Keras implementation of "One pixel attack for fooling deep neural networks" using differential evolution on Cifar10 and ImageNet

SCAAML is a deep learning framwork dedicated to side-channel attacks run on top of TensorFlow 2.x.

Open Source Light Field Toolbox for Super-Resolution

Use deep learning, genetic programming and other methods to predict stock and market movements

deep learning model with only python and numpy with test accuracy 99 % on mnist dataset and different optimization choices