We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

Last update: Sep 08, 2022

Related tags

Overview

This repository provides the implementation for the paper "Improved Regularization and Robustness for Fine-tuning in Neural Networks", which will be presented as a poster paper in NeurIPS'21.

In this work, we propose a regularized self-labeling approach that combines regularization and self-training methods for improving the generalization and robustness properties of fine-tuning. Our approach includes two components:

First, we encode layer-wise regularization to penalize the model weights at different layers of the neural net.
Second, we add self-labeling that relabels data points based on current neural net's belief and reweights data points whose confidence is low.

Requirements

Python >= 3.6
PyTorch >= 1.7
Optuna >= 2.5
Numpy

Usage

Our algorithm is based on layer-wise regularization and self label-correction and label-weighting.

As an example, here are the test accuracy results on the Indoor dataset with independent label noise:

Method	Noise = 20%	Noise = 40%	Noise = 60%	Noise = 80%
Ours	75.21 $\pm$ 0.46	68.13 $\pm$ 0.16	57.59 $\pm$ 0.55	34.08 $\pm$ 0.79
Fine-tuning	65.02 $\pm$ 0.39	57.49 $\pm$ 0.39	44.60 $\pm$ 0.95	27.09 $\pm$ 0.19

Run following code to replicate above results:

python train_label_noise.py --config configs/config_constraint_indoor.json --model ResNet18 \
    --reg_method constraint --reg_norm frob \
    --reg_extractor 7.80246991703043 --reg_predictor 14.077402847906 \
    --noise_rate 0.2 --train_correct_label --reweight_epoch 5 --reweight_temp 2.0 --correct_epoch 10 --correct_thres 0.9 

python train_label_noise.py --config configs/config_constraint_indoor.json --model ResNet18 \
    --reg_method constraint --reg_norm frob \
    --reg_extractor 8.47139398080791 --reg_predictor 19.0191127114923 \
    --noise_rate 0.4 --train_correct_label --reweight_epoch 5 --reweight_temp 2.0 --correct_epoch 10 --correct_thres 0.9 

python train_label_noise.py --config configs/config_constraint_indoor.json --model ResNet18 \
    --reg_method constraint --reg_norm frob \
    --reg_extractor 10.7576018531961 --reg_predictor 19.8157649727473 \
    --noise_rate 0.6 --train_correct_label --reweight_epoch 5 --reweight_temp 2.0 --correct_epoch 10 --correct_thres 0.9 
    
python train_label_noise.py --config configs/config_constraint_indoor.json --model ResNet18 \
    --reg_method constraint --reg_norm frob \
    --reg_extractor 9.2031662757248 --reg_predictor 6.41568500472423 \
    --noise_rate 0.8 --train_correct_label --reweight_epoch 5 --reweight_temp 1.5 --correct_epoch 10 --correct_thres 0.9

Data Preparation

We use seven image datasets in our paper. We list the link for downloading these datasets and describe how to prepare data to run our code below.

Aircrafts: download and extract into ./data/aircrafts
- remove the class 257.clutter out of the data directory
CUB-200-2011: download and extract into ./data/CUB_200_2011/
Caltech-256: download and extract into ./data/caltech256/
Stanford-Cars: download and extract into ./data/StanfordCars/
Stanford-Dogs: download and extract into ./data/StanfordDogs/
Flowers: download and extract into ./data/flowers/
MIT-Indoor: download and extract into ./data/Indoor/

Our code automatically handles the split of the datasets.

Citation

If you find this repository useful, consider citing our work titled above.

Acknowledgment

Thanks to the authors of mars-finetuning and WS-DAN.PyTorch for providing their implementation publicly available.

We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

Related tags

Overview

Overview

Requirements

Usage

Data Preparation

Citation

Acknowledgment

Owner

NEU-StatsML-Research

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

CVPR 2021 - Official code repository for the paper: On Self-Contact and Human Pose.

以孤立语假设和宽度优先搜索为基础，构建了一种多通道堆叠注意力Transformer结构的斗地主ai

A scikit-learn-compatible module for estimating prediction intervals.

Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers

PlenOctree Extraction algorithm

GAN JAX - A toy project to generate images from GANs with JAX

The Medical Detection Toolkit contains 2D + 3D implementations of prevalent object detectors such as Mask R-CNN, Retina Net, Retina U-Net, as well as a training and inference framework focused on dealing with medical images.

基于YoloX目标检测+DeepSort算法实现多目标追踪Baseline

⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

NPBG++: Accelerating Neural Point-Based Graphics

i3DMM: Deep Implicit 3D Morphable Model of Human Heads

Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

A quick recipe to learn all about Transformers

DLL: Direct Lidar Localization

Instance-conditional Knowledge Distillation for Object Detection

Contains modeling practice materials and homework for the Computational Neuroscience course at Okinawa Institute of Science and Technology

Parsing, analyzing, and comparing source code across many languages

RL and distillation in CARLA using a factorized world model