PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)

Last update: Jul 09, 2022

Overview

mlp-mixer-pytorch

PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)

Usage

import torch
from mlp_mixer import MLPMixer

model = MLPMixer(
    num_classes = 10,
    num_layers = 8,
    image_size = 256,
    patch_size = 32,
    channels = 3,
    hidden_dim = 512,
    tokens_hidden_dim = 256,
    channels_hidden_dim = 2048
)

x = torch.randn(1, 3, 256, 256)
model(x) # (1, 10)

Preconfigured models

from mlp_mixer.models import (
    mlp_mixer_s16,
    mlp_mixer_s32,
    mlp_mixer_b16,
    mlp_mixer_b32,
    mlp_mixer_l16,
    mlp_mixer_l32,
    mlp_mixer_h14
)

# From the paper S16: patch_size=16, num_layers=8, hidden_dim=512, tokens_hidden_dim=256, channels_hidden_dim=2048
model = mlp_mixer_s16(
    num_classes = 10,
    image_size = 256,
    channels = 3
)

x = torch.randn(1, 3, 256, 256)
model(x) # (1, 10)

Install

pip install -r requirements.txt

Tests

pytest -ra

Owner

isaac

Senior Computer Vision Engineer @ BlackSky, Ph.D. student in Electrical Engineering at the University of Texas at San Antonio.

GitHub Repository

DANet for Tabular data classification/ regression.

Deep Abstract Networks A PyTorch code implemented for the submission DANets: Deep Abstract Networks for Tabular Data Classification and Regression. Do

55 Sep 14, 2022

Sharpness-Aware Minimization for Efficiently Improving Generalization

Sharpness-Aware-Minimization-TensorFlow This repository provides a minimal implementation of sharpness-aware minimization (SAM) (Sharpness-Aware Minim

54 Dec 08, 2022

Explore extreme compression for pre-trained language models

Code for paper "Exploring extreme parameter compression for pre-trained language models ICLR2022"

16 Nov 14, 2022

YolactEdge: Real-time Instance Segmentation on the Edge

YolactEdge, the first competitive instance segmentation approach that runs on small edge devices at real-time speeds. Specifically, YolactEdge runs at up to 30.8 FPS on a Jetson AGX Xavier (and 172.7

1.1k Jan 06, 2023

Benchmarks for the Optimal Power Flow Problem

Power Grid Lib - Optimal Power Flow This benchmark library is curated and maintained by the IEEE PES Task Force on Benchmarks for Validation of Emergi

207 Dec 08, 2022

🔮 Execution time predictions for deep neural network training iterations across different GPUs.

Habitat: A Runtime-Based Computational Performance Predictor for Deep Neural Network Training Habitat is a tool that predicts a deep neural network's

44 Dec 27, 2022

Implements Stacked-RNN in numpy and torch with manual forward and backward functions

Recurrent Neural Networks Implements simple recurrent network and a stacked recurrent network in numpy and torch respectively. Both flavours implement

1 Nov 16, 2021

audioLIME: Listenable Explanations Using Source Separation

audioLIME This repository contains the Python package audioLIME, a tool for creating listenable explanations for machine learning models in music info

27 Dec 01, 2022

Head2Toe: Utilizing Intermediate Representations for Better OOD Generalization

Head2Toe: Utilizing Intermediate Representations for Better OOD Generalization Code for reproducing our results in the Head2Toe paper. Paper: arxiv.or

62 Dec 12, 2022

This is the official repository for evaluation on the NoW Benchmark Dataset. The goal of the NoW benchmark is to introduce a standard evaluation metric to measure the accuracy and robustness of 3D face reconstruction methods from a single image under variations in viewing angle, lighting, and common occlusions.

NoW Evaluation This is the official repository for evaluation on the NoW Benchmark Dataset. The goal of the NoW benchmark is to introduce a standard e

71 Dec 30, 2022

PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)

Related tags

Overview

mlp-mixer-pytorch

Usage

Preconfigured models

Install

Tests

Owner

isaac

DANet for Tabular data classification/ regression.

Sharpness-Aware Minimization for Efficiently Improving Generalization

Explore extreme compression for pre-trained language models

YolactEdge: Real-time Instance Segmentation on the Edge

Benchmarks for the Optimal Power Flow Problem

🔮 Execution time predictions for deep neural network training iterations across different GPUs.

Implements Stacked-RNN in numpy and torch with manual forward and backward functions

audioLIME: Listenable Explanations Using Source Separation

Head2Toe: Utilizing Intermediate Representations for Better OOD Generalization

Classifying audio using Wavelet transform and deep learning

Neural Fixed-Point Acceleration for Convex Optimization

Pretraining on Dynamic Graph Neural Networks

Implements an infinite sum of poisson-weighted convolutions

Locally cache assets that are normally streamed in POPULATION: ONE

A treasure chest for visual recognition powered by PaddlePaddle

Invariant Causal Prediction for Block MDPs

TransVTSpotter: End-to-end Video Text Spotter with Transformer

The code for SAG-DTA: Prediction of Drug–Target Affinity Using Self-Attention Graph Network.

Experiments on continual learning from a stream of pretrained models.