Code for paper: Towards Tokenized Human Dynamics Representation

Last update: May 31, 2022

Overview

Video Tokneization

Codebase for video tokenization, based on our paper Towards Tokenized Human Dynamics Representation.

Prerequisites (tested under Python 3.8 and CUDA 11.1)

apt-get install ffmpeg  
pip install torch==1.8  
pip install torchvision  
pip install pytorch-lightning  
pip install pytorch-lightning-bolts  
pip install aniposelib wandb gym test-tube ffmpeg-python matplotlib easydict scikit-learn

Data Preparation

Make a directory besides this repo and name it aistplusplus
Download from AIST++ website until it looks like

├── annotations
│   ├── cameras
│   ├── ignore_list.txt
│   ├── keypoints2d
│   ├── keypoints3d
│   ├── motions
│   └── splits
└── video_list.txt

How to run

Write one configuration file, e.g., configs/tan.yaml.
Run python pretrain.py --cfg configs/tan.yaml with GPU, which will create a folder under logs for this run. Folder name specified by the NAME in configuration file. Then run python cluster.py --cfg configs/tan.yaml (CPU-only) and check results in demo.ipynb.
Or you can download and unzip my training result into logs folder from here.

Code for paper: Towards Tokenized Human Dynamics Representation

Related tags

Overview

Video Tokneization

Prerequisites (tested under Python 3.8 and CUDA 11.1)

Data Preparation

How to run

Owner

Kenneth Li

[IJCAI-2021] A benchmark of data-free knowledge distillation from paper "Contrastive Model Inversion for Data-Free Knowledge Distillation"

Some bravo or inspiring research works on the topic of curriculum learning.

Vector Quantized Diffusion Model for Text-to-Image Synthesis

Sequence-tagging using deep learning

Prototype for Baby Action Detection and Classification

A MNIST-like fashion product database. Benchmark

Bayesian Meta-Learning Through Variational Gaussian Processes

TICC is a python solver for efficiently segmenting and clustering a multivariate time series

An Exact Solver for Semi-supervised Minimum Sum-of-Squares Clustering

maximal update parametrization (µP)

Pytorch implementation of MixNMatch

Wenzhou-Kean University AI-LAB

Segmentation vgg16 fcn - cityscapes

[NeurIPS 2021] “Improving Contrastive Learning on Imbalanced Data via Open-World Sampling”,

Source-to-Source Debuggable Derivatives in Pure Python

Multiband spectro-radiometric satellite image analysis with K-means cluster algorithm

Distributional Sliced-Wasserstein distance code

LLVM-based compiler for LightGBM gradient-boosted trees. Speeds up prediction by ≥10x.

Session-based Recommendation, CoHHN, price preferences, interest preferences, Heterogeneous Hypergraph, Co-guided Learning, SIGIR2022

A bare-bones TensorFlow framework for Bayesian deep learning and Gaussian process approximation