A PyTorch re-implementation of the paper 'Exploring Simple Siamese Representation Learning'. Reproduced the 67.8% Top1 Acc on ImageNet.

Last update: Nov 09, 2022

Related tags

Deep Learning pytorch-SimSiam

Overview

Exploring simple siamese representation learning

This is a PyTorch re-implementation of the SimSiam paper on ImageNet dataset. The results match that reported in the paper. The implementation is based on the codes of MOCO.

Unsupervised pre-training

To run unsupervised pre-training on ImageNet,

sh train_simsiam.sh

This is to do the unsupervised pre-training for 100 epochs. Please modify the path to your ImageNet data folder.

Note 1: I try to follow the setting in the paper, which is bs=512 and lr=0.1 on 8-GPU, but somehow I can not fit it. So I used the max batch_size that I can fit (432) while kept the lr unchaged (0.1).

Note 2: In pre-training, I didn't fix the lr of prediction MLP. According to the paper (Table. 1), fixing the lr of prediction MLP can give slightly improvements (67.7% -> 68.1%). You can try it if interested.

Linear evaluation

To run linear evaluation,

sh train_lincls.sh

The linear evaluation is done using NVIDIA LARC optimizer by setting trus_coefficient=0.001 and clip=False. The batch size is 4096.

Note: I first followed the setting in the paper, which is Lr=0.32 (0.02*4096/256). But I can only got a result of 66.0%. Then I increased the learning rate to Lr=1.6 (0.1*4096.256) and achieved the result of 67.8%. The results and models are given below.

SimSiam	pretrained batchsize	lincls Lr	Top-1 Acc
Reported	512	0.32	67.7%
Reproduced	432 (Model)	1.6	67.8% (Model)
Reproduced	432	0.32	66.0%

Acknowledgement

Thank Xinlei for his help on some implementation details.

A PyTorch re-implementation of the paper 'Exploring Simple Siamese Representation Learning'. Reproduced the 67.8% Top1 Acc on ImageNet.

Related tags

Overview

Exploring simple siamese representation learning

Unsupervised pre-training

Linear evaluation

Acknowledgement

Owner

Taojiannan Yang

This is a repository of our model for weakly-supervised video dense anticipation.

Dynamic Token Normalization Improves Vision Transformers

This is the implementation of "SELF SUPERVISED REPRESENTATION LEARNING WITH DEEP CLUSTERING FOR ACOUSTIC UNIT DISCOVERY FROM RAW SPEECH" submitted to ICASSP 2022

Fake-user-agent-traffic-geneator - Python CLI Tool to generate fake traffic against URLs with configurable user-agents

Code for the Higgs Boson Machine Learning Challenge organised by CERN & EPFL

D2Go is a toolkit for efficient deep learning

Improving Compound Activity Classification via Deep Transfer and Representation Learning

Generative Flow Networks for Discrete Probabilistic Modeling

Geometric Sensitivity Decomposition

Unofficial PyTorch Implementation of AHDRNet (CVPR 2019)

Official Implementation of SWAD (NeurIPS 2021)

Official implementation for the paper: Permutation Invariant Graph Generation via Score-Based Generative Modeling

Fast Differentiable Matrix Sqrt Root

[CVPR 2021] Official PyTorch Implementation for "Iterative Filter Adaptive Network for Single Image Defocus Deblurring"

QuALITY: Question Answering with Long Input Texts, Yes!

Bald-to-Hairy Translation Using CycleGAN

Code for the paper Progressive Pose Attention for Person Image Generation in CVPR19 (Oral).

Implementation of the paper "Language-agnostic representation learning of source code from structure and context".

This is a project based on ConvNets used to identify whether a road is clean or dirty. We have used MobileNet as our base architecture and the weights are based on imagenet.

Official PyTorch implementation of "Improving Face Recognition with Large AgeGaps by Learning to Distinguish Children" (BMVC 2021)