This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".

Last update: Dec 12, 2022

Related tags

Overview

AS-MLP architecture for Image Classification

Model Zoo

Image Classification on ImageNet-1K

Network	Resolution	Top-1 (%)	Params	FLOPs	Throughput (image/s)	model
AS-MLP-T	224x224	81.3	28M	4.4G	1047	onedrive
AS-MLP-S	224x224	83.1	50M	8.5G	619	onedrive
AS-MLP-B	224x224	83.3	88M	15.2G	455	onedrive

Usage

Install

Clone this repo:

git clone https://github.com/svip-lab/AS-MLP
cd AS-MLP

Create a conda virtual environment and activate it:

conda create -n asmlp python=3.7 -y
conda activate asmlp

Install CUDA==10.1 with cudnn7 following the official installation instructions
Install PyTorch==1.7.1 and torchvision==0.8.2 with CUDA==10.1:

conda install pytorch==1.7.1 torchvision==0.8.2 cudatoolkit=10.1 -c pytorch

Install timm==0.3.2:

pip install timm==0.3.2

Install cupy-cuda101:

pip install cupy-cuda101

Install Apex:

git clone https://github.com/NVIDIA/apex
cd apex
pip install -v --disable-pip-version-check --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" ./

Install other requirements:

pip install opencv-python==4.4.0.46 termcolor==1.1.0 yacs==0.1.8

Evaluation

To evaluate a pre-trained AS-MLP on ImageNet val, run:

bash train_scripts/test.sh

Training from scratch

To train a AS-MLP on ImageNet from scratch, run:

bash train_scripts/train.sh

You can easily reproduce our results. Enjoy!

Throughput

To measure the throughput, run:

bash train_scripts/get_throughput.sh

Citation

If this project is helpful for you, you can cite our paper:

@article{Lian_2021_ASMLP,
  author = {Lian, Dongze and Yu, Zehao and Sun, Xing and Gao, Shenghua},
  title = {AS-MLP: An Axial Shifted MLP Architecture for Vision},
  journal={arXiv preprint arXiv:2107.08391},
  year = {2021}
}

Acknowledgement

The code is built upon Swin-Transformer

This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".

Related tags

Overview

AS-MLP architecture for Image Classification

Model Zoo

Image Classification on ImageNet-1K

Usage

Install

Evaluation

Training from scratch

Throughput

Citation

Acknowledgement

Owner

SVIP Lab

SimpleDepthEstimation - An unified codebase for NN-based monocular depth estimation methods

PyTorch-based framework for Deep Hedging

Consecutive-Subsequence - Simple software to calculate susequence with highest sum

Wordplay, an artificial Intelligence based crossword puzzle solver.

The aim of the game, as in the original one, is to find a specific image from a group of different images of a person's face

The official repository for Deep Image Matting with Flexible Guidance Input

Pairwise Learning for Neural Link Prediction for OGB (PLNLP-OGB)

code for our paper "Source Data-absent Unsupervised Domain Adaptation through Hypothesis Transfer and Labeling Transfer"

A state-of-the-art semi-supervised method for image recognition

This project aims to be a handler for input creation and running of multiple RICEWQ simulations.

Code for all the Advent of Code'21 challenges mostly written in python

An off-line judger supporting distributed problem repositories

A simple and useful implementation of LPIPS.

This repo includes the supplementary of our paper "CEMENT: Incomplete Multi-View Weak-Label Learning with Long-Tailed Labels"

A booklet on machine learning systems design with exercises

Codes and models for the paper "Learning Unknown from Correlations: Graph Neural Network for Inter-novel-protein Interaction Prediction".

Fast Differentiable Matrix Sqrt Root

SOTA model in CIFAR10

The project is an official implementation of our paper "3D Human Pose Estimation with Spatial and Temporal Transformers".

Video-Captioning - A machine Learning project to generate captions for video frames indicating the relationship between the objects in the video