CATs: Semantic Correspondence with Transformers

Last update: Dec 10, 2021

Related tags

Text Data & NLP CATs

Overview

CATs: Semantic Correspondence with Transformers

For more information, check out the paper on [arXiv].

Training with different backbones and evaluations of them are to be updated soon..

Network

Our model CATs is illustrated below:

Environment Settings

git clone https://github.com/SunghwanHong/CATs
cd CATs

conda create -n CATs python=3.6
conda activate CATs

pip install torch==1.8.0+cu111 torchvision==0.9.0+cu111 torchaudio==0.8.0 -f https://download.pytorch.org/whl/torch_stable.html
pip install -U scikit-image
pip install git+https://github.com/albumentations-team/albumentations
pip install tensorboardX termcolor timm tqdm requests pandas

Evaluation

Download pre-trained weights on Link
All datasets are automatically downloaded into directory specified by argument datapath

Result on SPair-71k: (PCK 49.9%)

  python test.py --pretrained "/path_to_pretrained_model/spair" --benchmark spair

Result on SPair-71k, feature backbone frozen: (PCK 42.4%)

  python test.py --pretrained "/path_to_pretrained_model/spair_frozen" --benchmark spair

Results on PF-PASCAL: (PCK 75.4%, 92.6%, 96.4%)

  python test.py --pretrained "/path_to_pretrained_model/pfpascal" --benchmark pfpascal

Results on PF-PACAL, feature backbone frozen: (PCK 67.5%, 89.1%, 94.9%)

  python test.py --pretrained "/path_to_pretrained_model/pfpascal_frozen" --benchmark pfpascal

BibTeX

If you find this research useful, please consider citing:

@misc{cho2021semantic,
      title={Semantic Correspondence with Transformers}, 
      author={Seokju Cho and Sunghwan Hong and Sangryul Jeon and Yunsung Lee and Kwanghoon Sohn and Seungryong Kim},
      year={2021},
      eprint={2106.02520},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

CATs: Semantic Correspondence with Transformers

Related tags

Overview

CATs: Semantic Correspondence with Transformers

Network

Environment Settings

Evaluation

BibTeX

Owner

This is the writeup of all the challenges from Advent-of-cyber-2019 of TryHackMe

a test times augmentation toolkit based on paddle2.0.

Command Line Text-To-Speech using Google TTS

KoBART model on huggingface transformers

Sequence modeling benchmarks and temporal convolutional networks

Materials (slides, code, assignments) for the NYU class I teach on NLP and ML Systems (Master of Engineering).

Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx

Code for "Parallel Instance Query Network for Named Entity Recognition", accepted at ACL 2022.

Code for the paper "Language Models are Unsupervised Multitask Learners"

Refactored version of FastSpeech2

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

HiFi DeepVariant + WhatsHap workflowHiFi DeepVariant + WhatsHap workflow

Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch

KoBERT - Korean BERT pre-trained cased (KoBERT)

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

Speach Recognitions

A notebook that shows how to import the IITB English-Hindi Parallel Corpus from the HuggingFace datasets repository

Speech to text streamlit app

Train BPE with fastBPE, and load to Huggingface Tokenizer.