Code release for "Self-Tuning for Data-Efficient Deep Learning" (ICML 2021)

Last update: Dec 11, 2022

Related tags

Overview

Self-Tuning for Data-Efficient Deep Learning

This repository contains the implementation code for paper:
Self-Tuning for Data-Efficient Deep Learning
Ximei Wang, Jinghan Gao, Mingsheng Long, Jianmin Wang
38th International Conference on Machine Learning (ICML 2021)
[Project Page] [Paper] [Video] [Slide] [Poster] [Blog] [Zhihu] [SlidesLive]

Brief Introduction for Data-Efficient Deep Learning

Mitigating the requirement for labeled data is a vital issue in deep learning community. However, common practices of TL and SSL only focus on either the pre-trained model or unlabeled data. This paper unleashes the power of both worlds by proposing a new setup named data-efficient deep learning, aims to mitigate the requirement of labeled data by unifying the exploration of labeled and unlabeled data and the transfer of pre-trained model.

To address the challenge of confirmation bias in self-training, a general Pseudo Group Contrast mechanism is devised to mitigate the reliance on pseudo-labels and boost the tolerance to false labels. To tackle the model shift problem, we unify the exploration of labeled and unlabeled data and the transfer of a pre-trained model, with a shared key queue beyond just 'parallel training'. Comprehensive experiments demonstrate that Self-Tuning outperforms its SSL and TL counterparts on five tasks by sharp margins, e.g., it doubles the accuracy of fine-tuning on Stanford-Cars provided with 15% labels.

Dependencies

python3.6
torch == 1.3.1 (with suitable CUDA and CuDNN version)
torchvision == 0.4.2
tensorboardX
numpy
argparse

Datasets

Dataset	Download Link
CUB-200-2011	http://www.vision.caltech.edu/visipedia/CUB-200-2011.html
Stanford Cars	http://ai.stanford.edu/~jkrause/cars/car_dataset.html
FGVC Aircraft	http://www.robots.ox.ac.uk/~vgg/data/fgvc-aircraft/
Cifar100	https://www.cs.toronto.edu/~kriz/cifar.html

You can either download datasets via the above links or directly run the commands shown below to automatically download datasets as well as data lists from Tsinghua Cloud.

Disclaimer on Datasets

This open-sourced code will download and prepare public datasets. We do not host or distribute these datasets, vouch for their quality or fairness, or claim that you have licenses to use the dataset. It is your responsibility to determine whether you have permission to use the dataset under the dataset's license.

If you're a dataset owner and wish to update any part of it (description, citation, etc.), or do not want your dataset to be included in this code, please get in touch with us through a GitHub issue. Thanks for your contribution to the ML community!

Quick Start

The running commands for several datasets are shown below. Please refer to run.sh for commands for datasets with other label ratios.

python src/main.py  --root ./StanfordCars --batch_size 24 --logdir vis/ --gpu_id 0 --queue_size 32 --projector_dim 1024 --backbone resnet50  --label_ratio 15 --pretrained
python src/main.py  --root ./CUB200 --batch_size 24 --logdir vis/ --gpu_id 1 --queue_size 32 --projector_dim 1024 --backbone resnet50 --label_ratio 15 --pretrained
python src/main.py  --root ./Aircraft --batch_size 24 --logdir vis/ --gpu_id 2 --queue_size 32 --projector_dim 1024 --backbone resnet50 --label_ratio 15 --pretrained
python src/main.py  --root ./cifar100 --batch_size 20 --logdir vis/ --gpu_id 3 --queue_size 32 --backbone efficientnet-b2 --num_labeled 10000 --expand_label --pretrained --projector_dim 1024

Tensorboard Log

Dataset	Label Ratio 1	Label Ratio 2	Label Ratio 3
CUB-200-2011	15%	30%	50%
Stanford Cars	15%	30%	50%
FGVC Aircraft	15%	30%	50%
Cifar100	400	2500	10000

We achieved better results than that reported in the paper, after fixing some small bugs of the code.

Updates

[07/2021] We have created a Blog post in Chinese for this work. Check it out for more details!
[07/2021] We have released the code and models. You can find all reproduced checkpoints via this link.
[06/2021] A five minute video is released to briefly introduce the main idea of Self-Tuning.
[05/2021] Paper accepted to ICML 2021 as a Short Talk.
[02/2021] arXiv version posted. Please stay tuned for updates.

Citation

If you find this code or idea useful, please cite our work:

@inproceedings{wang2021selftuning,
  title={Self-Tuning for Data-Efficient Deep Learning},
  author={Wang, Ximei and Gao, Jinghan and Long, Mingsheng and Wang, Jianmin},
  booktitle={International Conference on Machine Learning (ICML)},
  year={2021}
}

Contact

If you have any questions, feel free to contact us through email ([email protected]) or Github issues. Enjoy!

Code release for "Self-Tuning for Data-Efficient Deep Learning" (ICML 2021)

Related tags

Overview

Self-Tuning for Data-Efficient Deep Learning

Brief Introduction for Data-Efficient Deep Learning

Dependencies

Datasets

Disclaimer on Datasets

Quick Start

Tensorboard Log

Updates

Citation

Contact

Owner

THUML @ Tsinghua University

Facilitating Database Tuning with Hyper-ParameterOptimization: A Comprehensive Experimental Evaluation

The backbone CSPDarkNet of YOLOX.

HGCAE Pytorch implementation. CVPR2021 accepted.

Extension to fastai for volumetric medical data

This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 2020

FPSAutomaticAiming——基于YOLOV5的FPS类游戏自动瞄准AI

YOLOX_AUDIO is an audio event detection model based on YOLOX

Code for the paper "A Study of Face Obfuscation in ImageNet"

This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes dataset. This code is implemented in Pytorch.

[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games

The official implementation of paper Siamese Transformer Pyramid Networks for Real-Time UAV Tracking, accepted by WACV22

RobustVideoMatting and background composing in one model by using onnxruntime.

[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

Official codebase used to develop Vision Transformer, MLP-Mixer, LiT and more.

Official Implementation of SWAGAN: A Style-based Wavelet-driven Generative Model

Official PyTorch implementation of Segmenter: Transformer for Semantic Segmentation

It's like Shape Editor in Maya but works with skeletons (transforms).

Codebase of deep learning models for inferring stability of mRNA molecules

Deep Learning as a Cloud API Service.

A Closer Look at Structured Pruning for Neural Network Compression