PyTorch implementation of DirectCLR from paper Understanding Dimensional Collapse in Contrastive Self-supervised Learning

Last update: Dec 21, 2022

Related tags

Deep Learning directclr

Overview

DirectCLR

DirectCLR is a simple contrastive learning model for visual representation learning. It does not require a trainable projector as SimCLR. It is able to prevent dimensional collapse and outperform SimCLR with a linear projector.

PyTorch implementation of DirectCLR from paper Understanding Dimensional Collapse in Contrastive Self-supervised Learning.

@article{Jing2021UnderstandingDC,
  title={Understanding Dimensional Collapse in Contrastive Self-supervised Learning},
  author={Li Jing and Pascal Vincent and Yann LeCun and Yuandong Tian},
  journal={arXiv preprint arXiv:2110.09348},
  year={2021}
}

DirectCLR Training

Install PyTorch and download ImageNet by following the instructions in the requirements section of the PyTorch ImageNet training example. The code has been developed for PyTorch version 1.7.1 and torchvision version 0.8.2, but it should work with other versions just as well.

Our best model is obtained by running the following command:

python main.py --data /path/to/imagenet/ --mode directclr --dim 360

Mode can be chosen as:

simclr: standard SimCLR with two layer nonlinear projector;

single: SimCLR with single layer linear projector;

baseline: SimCLR without a projector;

directclr: DirectCLR with single layer linear projector;

Training time is approximately 7 hours on 32 v100 GPUs.

Evaluation: Linear Classification

Train a linear probe on the representations. Freeze the weights of the resnet and use the entire ImageNet training set.

python linear_probe.py /path/to/imagenet/ /path/to/checkpoint/resnet50.pth

Linear probe time is approximately 20 hours on 8 v100 GPUs.

License

This project is under the CC-BY-NC 4.0 license. See LICENSE for details.

PyTorch implementation of DirectCLR from paper Understanding Dimensional Collapse in Contrastive Self-supervised Learning

Related tags

Overview

DirectCLR

DirectCLR Training

Evaluation: Linear Classification

License

Owner

Meta Research

A general, feasible, and extensible framework for classification tasks.

A model that attempts to learn and benefit from data collected on card counting.

This project demonstrates the use of neural networks and computer vision to create a classifier that interprets the Brazilian Sign Language.

Python inverse kinematics for your robot model based on Pinocchio.

YOLOX Win10 Project

Layered Neural Atlases for Consistent Video Editing

Official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.

UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation

Natural Posterior Network: Deep Bayesian Predictive Uncertainty for Exponential Family Distributions

A numpy-based implementation of RANSAC for fundamental matrix and homography estimation. The degeneracy updating and local optimization components are included and optional.

Face Recognition plus identification simply and fast | Python

The official implementation of Variable-Length Piano Infilling (VLI).

Mercury: easily convert Python notebook to web app and share with others

Machine learning framework for both deep learning and traditional algorithms

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Simple, but essential Bayesian optimization package

This repository contains all the code and materials distributed in the 2021 Q-Programming Summer of Qode.

Churn prediction

A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)

A SAT-based sudoku solver