Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.

Last update: Dec 12, 2022

Related tags

Deep Learning aft-pytorch

Overview

aft-pytorch

Unofficial PyTorch implementation of Attention Free Transformer's layers by Zhai, et al. [abs, pdf] from Apple Inc.

Installation

You can install aft-pytorch via pip:

pip install aft-pytorch

Usage

You can import the AFT-Full or AFT-Simple layer (as described in the paper) from the package like so:

`AFTFull`

from aft_pytorch import AFTFull

layer = AFTFull(
    max_seqlen=20,
    dim=512,
    hidden_dim=64
)

# a batch of sequences with 10 timesteps of length 512 each
x = torch.rand(32, 10, 512)
y = layer(x) # [32, 10, 512]

`AFTSimple`

from aft_pytorch import AFTSimple

layer = AFTSimple(
    max_seqlen=20,
    dim=512,
    hidden_dim=64
)

# a batch of sequences with 10 timesteps of length 512 each
x = torch.rand(32, 10, 512)
y = layer(x) # [32, 10, 512]

This layer wrapper is a 'plug-and-play' with your existing networks / Transformers. You can swap out the Self-Attention layer with the available layers in this package with minimal changes.

TODO

Add full AFT architecture
Add variants like, AFTConv, AFTLocal

Contributing

If you like this repo, please leave a star! If there are any amends or suggestions, feel free to raise a PR/issue.

Credits

@misc{attention-free-transformer,
title = {An Attention Free Transformer},
author = {Shuangfei Zhai and Walter Talbott and Nitish Srivastava and Chen Huang and Hanlin Goh and Ruixiang Zhang and Josh Susskind},
year = {2021},
URL = {https://arxiv.org/pdf/2105.14103.pdf}
}

License

MIT

Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.

Related tags

Overview

aft-pytorch

Installation

Usage

`AFTFull`

`AFTSimple`

TODO

Contributing

Credits

License

Owner

Rishabh Anand

The implementation of the CVPR2021 paper "Structure-Aware Face Clustering on a Large-Scale Graph with 10^7 Nodes"

The official codes of our CVPR2022 paper: A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift

A collection of semantic image segmentation models implemented in TensorFlow

An efficient 3D semantic segmentation framework for Urban-scale point clouds like SensatUrban, Campus3D, etc.

NCVX (NonConVeX): A User-Friendly and Scalable Package for Nonconvex Optimization in Machine Learning.

We evaluate our method on different datasets (including ShapeNet, CUB-200-2011, and Pascal3D+) and achieve state-of-the-art results, outperforming all the other supervised and unsupervised methods and 3D representations, all in terms of performance, accuracy, and training time.

Implementation of fast algorithms for Maximum Spanning Tree (MST) parsing that includes fast ArcMax+Reweighting+Tarjan algorithm for single-root dependency parsing.

Categorizing comments on YouTube into different categories.

Code for "Modeling Indirect Illumination for Inverse Rendering", CVPR 2022

Code for "SRHEN: Stepwise-Refining Homography Estimation Network via Parsing Geometric Correspondences in Deep Latent Space"

Finetune SSL models for MOS prediction

DNA-RECON { Automatic Web Reconnaissance Tool }

Training and Evaluation Code for Neural Volumes

这是一个yolox-pytorch的源码，可以用于训练自己的模型。

Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

An example to implement a new backbone with OpenMMLab framework.

LSTM-VAE Implementation and Relevant Evaluations

PyTorch implementation of ShapeConv: Shape-aware Convolutional Layer for RGB-D Indoor Semantic Segmentation.

COVID-Net Open Source Initiative

Joint Discriminative and Generative Learning for Person Re-identification. CVPR'19 (Oral)