MobileFormer

An implementation of MobileFormer proposed by Yinpeng Chen, Xiyang Dai et al.

Including

[1] Mobile-Former proposed in: 
                        Yinpeng Chen, Xiyang Dai et al., Mobile-Former: Bridging MobileNet and Transformer. 
                        arxiv.org/abs/2108.05895
[2] Dynamtic ReLU proposed in: 
                        Yinpeng Chen, Xiyang Dai et al., Dynamtic ReLU. 
                        arxiv.org/abs/2003.10027v2
[3] Lite-BottleNeck proposed in: 
                        Yunsheng Li, Yinpeng Chen et al., MicroNet: Improving Image Recognition with Extremely Low FLOPs. 
                        arxiv.org/abs/2108.05894v1
[4] Adam-W proposed in:
                        Ilya Loshchilov & Frank Hutter, Decoupled Weight Decay Regularization.
                        arxiv.org/abs/1711.05101v3
[5] Mixup proposed in:
                        Hongyi Zhang, Moustapha Cisse et al., Mixup: Beyond Empircal Risk Minimization.
                        arxiv.org/abs/1710.09412
[6] Multi-FocalLoss (not used), focal loss is proposed in:
                        Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, Piotr Dollár, Focal Loss for Dense Object Detection.
                        arxiv.org/abs/1708.02002

Note

(1) Due to the expanded DW conv used in strided Mobile-Former blocks, 
    the out_channel should be divisible by expand_size of the next block.
(2) Adam-W and Mixup is embedded in train.py.
(3) Use run() in train.py to train('run') or search('search'). There is an example in the train.py.

'###### The '#'s #######'

'##### are aligned #####'

No pre-train parameters for now.

An implementation of MobileFormer

Related tags

Overview

MobileFormer

Including

Note

'###### The '#'s #######'

'##### are aligned #####'

Owner

slwang9353

PyTorch implementations of deep reinforcement learning algorithms and environments

Deep Learning: Architectures & Methods Project: Deep Learning for Audio Super-Resolution

MPI Interest Group on Algorithms on 1st semester 2021

Codebase for Diffusion Models Beat GANS on Image Synthesis.

Implementation of Hire-MLP: Vision MLP via Hierarchical Rearrangement and An Image Patch is a Wave: Phase-Aware Vision MLP.

Code for paper PairRE: Knowledge Graph Embeddings via Paired Relation Vectors.

Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction

Efficient Online Bayesian Inference for Neural Bandits

A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-localization in Large Scenes from Body-Mounted Sensors, CVPR 2021

DFFNet: An IoT-perceptive Dual Feature Fusion Network for General Real-time Semantic Segmentation

MADT: Offline Pre-trained Multi-Agent Decision Transformer

Notes taking website build with Docker + Django + React.

Ascend your Jupyter Notebook usage

Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading

Course content and resources for the AIAIART course.

Automatic library of congress classification, using word embeddings from book titles and synopses.

[NeurIPS 2021] Well-tuned Simple Nets Excel on Tabular Datasets

JAX-based neural network library