Unofficial PyTorch implementation of TokenLearner by Google AI

Last update: Dec 20, 2022

Related tags

Deep Learning tokenlearner-pytorch

Overview

tokenlearner-pytorch

Unofficial PyTorch implementation of TokenLearner by Ryoo et al. from Google AI (abs, pdf)

Installation

You can install TokenLearner via pip:

pip install tokenlearner-pytorch

Usage

You can access the TokenLearner class from the tokenlearner_pytorch package. You can use this layer with a Vision Transformer, MLPMixer, or Video Vision Transformer as done in the paper.

import torch
from tokenlearner_pytorch import TokenLearner

tklr = TokenLearner(S=8)
x = torch.rand(512, 32, 32, 3)
y = tklr(x) # [512, 8, 3]

You can also use TokenLearner and TokenFuser together with Multi-head Self-Attention as done in the paper:

import torch
import torch.nn as nn
from tokenlearner_pytorch import TokenLearner, TokenFuser

mhsa = nn.MultiheadAttention(3, 1)
tklr = TokenLearner(S=8)
tkfr = TokenFuser(H=32, W=32, C=3, S=8)

x = torch.rand(512, 32, 32, 3) # a batch of images

y = tklr(x)
y = y.view(8, 512, 3)
y, _ = mhsa(y, y, y) # ignore attn weights
y = y.view(512, 8, 3)

out = tkfr(y, x) # [512, 32, 23, 3]

TODO

Add support for temporal dimension T
Implement TokenFuser with ViT
Implement TokenFuser with ViViT

Contributions

If I've made any errors or you have any suggestions, feel free to raise an Issue or PR. All contributions welcome!!

License

MIT

Unofficial PyTorch implementation of TokenLearner by Google AI

Related tags

Overview

tokenlearner-pytorch

Installation

Usage

TODO

Contributions

License

Owner

Rishabh Anand

Official implementation of "Implicit Neural Representations with Periodic Activation Functions"

Node Editor Plug for Blender

PoseCamera is python based SDK for human pose estimation through RGB webcam.

MLJetReconstruction - using machine learning to reconstruct jets for CMS

CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection

BasicNeuralNetwork - This project looks over the basic structure of a neural network and how machine learning training algorithms work

The code for Bi-Mix: Bidirectional Mixing for Domain Adaptive Nighttime Semantic Segmentation

Official implementation for the paper: Permutation Invariant Graph Generation via Score-Based Generative Modeling

TRIQ implementation

PyTorch implementation of "VRT: A Video Restoration Transformer"

The code for replicating the experiments from the LFI in SSMs with Unknown Dynamics paper.

MT-GAN-PyTorch - PyTorch Implementation of Learning to Transfer: Unsupervised Domain Translation via Meta-Learning

A minimalist implementation of score-based diffusion model

NEG loss implemented in pytorch

Dashboard for the COVID19 spread

DiffStride: Learning strides in convolutional neural networks

Python implementation of "Single Image Haze Removal Using Dark Channel Prior"

The 1st place solution of track2 (Vehicle Re-Identification) in the NVIDIA AI City Challenge at CVPR 2021 Workshop.

An atmospheric growth and evolution model based on the EVo degassing model and FastChem 2.0

Scalable Graph Neural Networks for Heterogeneous Graphs