This repository builds a basic vision transformer from scratch so that one beginner can understand the theory of vision transformer.

Last update: Dec 24, 2021

Related tags

Overview

vision-transformer-from-scratch

This repository includes several kinds of vision transformers from scratch so that one beginner can understand the theory of vision transformer easily. The basic transformer,the linformer transformer and the swin transformer are all trained and tested.

Requirements: PyTorch (>= 1.6.0); Python 3.6.9; Numpy (1.18.2); OpenCV ; Linformer;

Train the model: python main_train.py; In the main_train.py the basic transformer and the linformer can be selected.

Test the model: python test.py; In the main_train.py the basic transformer and the linformer can be selected.

The theory of vision transformer can reference the following document: https://towardsdatascience.com/implementing-visualttransformer-in-pytorch-184f9f16f632; https://www.kaggle.com/hannes82/vision-transformer-trained-from-scratch-pytorch;

Owner

GitHub Repository

Geometry-Aware Learning of Maps for Camera Localization (CVPR2018)

Geometry-Aware Learning of Maps for Camera Localization This is the PyTorch implementation of our CVPR 2018 paper "Geometry-Aware Learning of Maps for

321 Nov 26, 2022

AdamW optimizer for bfloat16 models in pytorch.

Image source AdamW optimizer for bfloat16 models in pytorch. Bfloat16 is currently an optimal tradeoff between range and relative error for deep netwo

8 Nov 20, 2022

Official implementation of Representer Point Selection via Local Jacobian Expansion for Post-hoc Classifier Explanation of Deep Neural Networks and Ensemble Models at NeurIPS 2021

Representer Point Selection via Local Jacobian Expansion for Classifier Explanation of Deep Neural Networks and Ensemble Models This repository is the

2 Dec 01, 2021

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Official code for Continual Learning In Environments With Polynomial Mixing Times Continual Learning in Environments with Polynomial Mixing Times This

1 Dec 19, 2021

A Tensorflow implementation of the Text Conditioned Auxiliary Classifier Generative Adversarial Network for Generating Images from text descriptions

93 Aug 04, 2022

This repository builds a basic vision transformer from scratch so that one beginner can understand the theory of vision transformer.

Related tags

Overview

vision-transformer-from-scratch

Owner

Geometry-Aware Learning of Maps for Camera Localization (CVPR2018)

AdamW optimizer for bfloat16 models in pytorch.

Official implementation of Representer Point Selection via Local Jacobian Expansion for Post-hoc Classifier Explanation of Deep Neural Networks and Ensemble Models at NeurIPS 2021

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

A Tensorflow implementation of the Text Conditioned Auxiliary Classifier Generative Adversarial Network for Generating Images from text descriptions

Customised to detect objects automatically by a given model file(onnx)

Code for "Diffusion is All You Need for Learning on Surfaces"

A tensorflow implementation of GCN-LPA

Official Pytorch implementation of 'RoI Tanh-polar Transformer Network for Face Parsing in the Wild.'

MWPToolkit is a PyTorch-based toolkit for Math Word Problem (MWP) solving.

CvT2DistilGPT2 is an encoder-to-decoder model that was developed for chest X-ray report generation.

SSD-based Object Detection in PyTorch

An Easy-to-use, Modular and Prolongable package of deep-learning based Named Entity Recognition Models.

Weakly supervised medical named entity classification

An LSTM based GAN for Human motion synthesis

Implementation of the paper Recurrent Glimpse-based Decoder for Detection with Transformer.

Implementation for Shape from Polarization for Complex Scenes in the Wild

Adversarial Color Enhancement: Generating Unrestricted Adversarial Images by Optimizing a Color Filter

Collection of NLP model explanations and accompanying analysis tools

Learning Open-World Object Proposals without Learning to Classify