A PyTorch library for Vision Transformers

Last update: Nov 28, 2022

Related tags

Deep Learning vformer

Overview

VFormer

A PyTorch library for Vision Transformers

Getting Started

Read the contributing guidelines in CONTRIBUTING.rst to learn how to start contributing.

Comments

Add attention visualization methods
This article details different ways of visualizing a transformer's attention. It also talks about how such visualizations can aid in explainability of the models.

They also provide their code here.

We would like to have such visualization methods in the viz module.

good first issue
opened by NeelayS 7
Remove _Projection class

We can replace _Projection class with a one-liner if-else statement.

Should we replace it with if-else or should we keep the current implementation?

cc: @NeelayS @aditya-agrawal-30502 @alvanli

opened by abhi-glitchhg 6
Enhanced docstring

During the last PR (#45), I had to revert back because of compatibility issues

In this PR I have added some docstrings and Minor changes like changing variable names

this PR is the same as - #48 with edited title :)

@NeelayS

opened by abhi-glitchhg 3
Restructuring AbsolutePositionEmbedding class

AbsolutePositionEmbedding class was structured specifically for the PVT, but we can use it in other models too if we re-structure it properly, it should also support sinusoidal position embedding or a separate class for Sinusoidal embedding also works.
enhancement

opened by abhi-glitchhg 2
Add sharpness-aware optimizer

This paper describes how promoting smoothness with a recently proposed sharpness-aware optimizer substantially improves the performance of ViTs.

It would be good to have an implementation of this optimizer in our library. It would fit in the functional module.

A couple of PyTorch implementations are here and here.

opened by NeelayS 2
Documentation related to visualization methods

I have added some fixes for page breaks in #86.

Still, we need to enhance the docs for visualization methods.
We can include the license/copyright disclaimer for visualization methods in our license or have a separate file.

Additionally, we can add the sample outputs from these methods into the doc.

CC : @NeelayS @aditya-agrawal-30502 @alvanli
documentation enhancement good first issue

opened by abhi-glitchhg 1
[Paper] Visual Attention Network

paper - https://arxiv.org/abs/2202.09741 code- https://github.com/Visual-Attention-Network/VAN-Classification https://github.com/Visual-Attention-Network/VAN-Segmentation
Paper implementation

opened by abhi-glitchhg 0

Releases(v0.1.3)

v0.1.3(Jul 3, 2022)

Source code(tar.gz)
Source code(zip)
v0.1.2(Apr 7, 2022)

Source code(tar.gz)
Source code(zip)
v0.1.0(Feb 9, 2022)

First release of VFormer!
Source code(tar.gz)
Source code(zip)

Owner

Society for Artificial Intelligence and Deep Learning

GitHub Repository

Estimating Example Difficulty using Variance of Gradients

Estimating Example Difficulty using Variance of Gradients This repository contains source code necessary to reproduce some of the main results in the

48 Dec 26, 2022

Architecture Patterns with Python (TDD, DDD, EDM)

architecture-traning Architecture Patterns with Python (TDD, DDD, EDM) Chapter 5. 높은 기어비와 낮은 기어비의 TDD 5.2 도메인 계층 테스트를 서비스 계층으로 옮겨야 하는가? 도메인 계층 테스트 def

2 Mar 04, 2022

Intel® Neural Compressor is an open-source Python library running on Intel CPUs and GPUs

Intel® Neural Compressor targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep l

846 Jan 04, 2023

Pmapper is a super-resolution and deconvolution toolkit for python 3.6+

pmapper pmapper is a super-resolution and deconvolution toolkit for python 3.6+. PMAP stands for Poisson Maximum A-Posteriori, a highly flexible and a

8 Nov 06, 2022

Feedback is important: response-aware feedback mechanism for background based conversation

RFM The code for the paper: "Feedback is important: response-aware feedback mechanism for background based conversation." Requirements python 3.7 pyto

2 Sep 29, 2022

Image Captioning on google cloud platform based on iot

Image-Captioning-on-google-cloud-platform-based-on-iot - Image Captioning on google cloud platform based on iot

1 Jan 20, 2022

Implementation of "Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency"

Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency (ICCV2021) Paper Link: https://arxiv.org/abs/2107.11355 This implementation bui

32 Nov 17, 2022

[TNNLS 2021] The official code for the paper "Learning Deep Context-Sensitive Decomposition for Low-Light Image Enhancement"

CSDNet-CSDGAN this is the code for the paper "Learning Deep Context-Sensitive Decomposition for Low-Light Image Enhancement" Environment Preparing pyt

17 Nov 05, 2022

Deep-Learning-Book-Chapter-Summaries - Attempting to make the Deep Learning Book easier to understand.

Deep-Learning-Book-Chapter-Summaries This repository provides a summary for each chapter of the Deep Learning book by Ian Goodfellow, Yoshua Bengio an

1k Dec 27, 2022

Machine Learning Framework for Operating Systems - Brings ML to Linux kernel

KML: A Machine Learning Framework for Operating Systems & Storage Systems Storage systems and their OS components are designed to accommodate a wide v

186 Nov 24, 2022

Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A Benchmark

This dataset is a large-scale dataset for moving object detection and tracking in satellite videos, which consists of 40 satellite videos captured by Jilin-1 satellite platforms.

87 Dec 22, 2022

This is the official implementation of "One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval".

CORA This is the official implementation of the following paper: Akari Asai, Xinyan Yu, Jungo Kasai and Hannaneh Hajishirzi. One Question Answering Mo

59 Dec 28, 2022

Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"

Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition" Pre-trained Deep Convo

5 Nov 11, 2022

A PyTorch library for Vision Transformers

Related tags

Overview

VFormer

A PyTorch library for Vision Transformers

Getting Started

Comments

Add attention visualization methods

Remove _Projection class

Enhanced docstring

Restructuring AbsolutePositionEmbedding class

Add sharpness-aware optimizer

Documentation related to visualization methods

[Paper] Visual Attention Network

Releases(v0.1.3)

v0.1.3(Jul 3, 2022)

v0.1.2(Apr 7, 2022)

v0.1.0(Feb 9, 2022)

Owner

Society for Artificial Intelligence and Deep Learning

Estimating Example Difficulty using Variance of Gradients

Architecture Patterns with Python (TDD, DDD, EDM)

Intel® Neural Compressor is an open-source Python library running on Intel CPUs and GPUs

Pmapper is a super-resolution and deconvolution toolkit for python 3.6+

Feedback is important: response-aware feedback mechanism for background based conversation

Image Captioning on google cloud platform based on iot

Implementation of "Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency"

[TNNLS 2021] The official code for the paper "Learning Deep Context-Sensitive Decomposition for Low-Light Image Enhancement"

Deep-Learning-Book-Chapter-Summaries - Attempting to make the Deep Learning Book easier to understand.

Machine Learning Framework for Operating Systems - Brings ML to Linux kernel

Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A Benchmark

This is the official implementation of "One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval".

Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"

SpeechBrain is an open-source and all-in-one speech toolkit based on PyTorch.

Starter code for the ICCV 2021 paper, 'Detecting Invisible People'

TensorRT examples (Jetson, Python/C++)(object detection)

Efficient Two-Step Networks for Temporal Action Segmentation (Neurocomputing 2021)

Finetuner allows one to tune the weights of any deep neural network for better embeddings on search tasks

A rough implementation of the paper "A Steering Algorithm for Redirected Walking Using Reinforcement Learning"

Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations