SwinTransformerV2-TensorFlow

A TensorFlow implementation of SwinTransformerV2 by Microsoft Research Asia, based on their official implementation of SwinTransformerV1 and their paper on V2.

Paper on Version 2 (18/11/2021): [arXiv]

Paper on Version 1 (17/08/2021): [arXiv]

Features:

TensorFlow 2 implementation of version 1 and 2 of the SwinTransformer, a state-of-the-art backbone for many contemporaty tasks in computer vision. A brief overview of the architectural changes made in version 2:

A pre-norm configuration replaces the previous post-norm configuration, meant to improve training stability in larger models.
A scaled cosine attention replaces the dot product attention in V1, with a learnable scaler.
A continuous log-spaced relative position bias is used instead of the previous parametric table approach. This is implemented here as a small MLP network and a log transform on the relative coordinates bias.

Requirements:

numpy==1.21.4
tensorflow==2.7.0
tensorflow_addons==0.15.0

Getting started

Currently writing up.

License

This project is licensed under the MIT license.

Citation

@article{liu2021Swin,
  title={Swin Transformer: Hierarchical Vision Transformer using Shifted Windows},
  author={Liu, Ze and Lin, Yutong and Cao, Yue and Hu, Han and Wei, Yixuan and Zhang, Zheng and Lin, Stephen and Guo, Baining},
  journal={arXiv preprint arXiv:2103.14030},
  year={2021}
}

Implementation of SwinTransformerV2 in TensorFlow.

Related tags

Overview

SwinTransformerV2-TensorFlow

Features:

Requirements:

Getting started

License

Citation

Owner

Phan Nguyen

Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"

Awesome Remote Sensing Toolkit based on PaddlePaddle.

Image Lowpoly based on Centroid Voronoi Diagram via python-opencv and taichi

[ICCV'21] Pri3D: Can 3D Priors Help 2D Representation Learning?

City-Scale Multi-Camera Vehicle Tracking Guided by Crossroad Zones Code

Code for Neural-GIF: Neural Generalized Implicit Functions for Animating People in Clothing(ICCV21)

Its a Plant Leaf Disease Detection System based on Machine Learning.

TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in Video

Used to record WKU's utility bills on a regular basis.

Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition

Seq2seq - Sequence to Sequence Learning with Keras

Text to image synthesis using thought vectors

HybridNets: End-to-End Perception Network

(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain

Organseg dags - The repository contains the codebase for multi-organ segmentation with directed acyclic graphs (DAGs) in CT.

FIRM-AFL is the first high-throughput greybox fuzzer for IoT firmware.

Using LSTM to detect spoofing attacks in an Air-Ground network

Implementation of the paper "Shapley Explanation Networks"

An Official Repo of CVPR '20 "MSeg: A Composite Dataset for Multi-Domain Segmentation"

The official repo of the CVPR 2021 paper Group Collaborative Learning for Co-Salient Object Detection .