MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Last update: Jan 16, 2022

Related tags

Overview

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Using mixup data augmentation as reguliraztion and tuning the hyper parameters of ResNet 50 models to achieve 94.57% test accuracy on CIFAR-10 Dataset. Link to paper

network	error %
resnet-50	6.97
resnet-110	6.61
resnet-164	5.93
resnet-1001	7.61
This method	5.43

Overview

Change the wandb api key to valid api key.
Python 3.8 and pytorch 1.9 (works on older versions as well)
main.py is to train model
sweep.py and sweep_config.py are for hyperparameter optimization for experiment tracking wandb is used please change api key
pred.py is to run the trained model on the custom data. (Appropriately provide model paths)

Important

If you want to run sweep.py then you must use wandb apikey and if you want to run main.py use wandb to log the experiment for comparision else comment out wandb part.

Training


# Start training with:

python main.py (Added --run_name optional argument for better tracking experiments)

  

# You can manually resume the training with:

python main.py --resume --lr=0.01

Hyperparameters sweep


# Start sweep with:

python sweep.py

  

# Provide appropriate hyperparameters range in sweep_config.py (Config written in py file to use the power of math package for sweep configs)

Running on custom dataset


# Convert traget data of (N*32*32*3) into (N*3*32*32) shape and pass through the model:

python pred.py (Provide path of the saved models)

Other files

mixup.py contains functions to claculate loss of mixup predictions as you cant use nn.CrossEntropyLoss
utils.py contain somehelper functions
dataloader.py is a torch class based dataloader of our train data (CIFAR-10 data)
private_loader.py is a torch class based dataloader of our private data.
Transformations are done using torchtransforms in main.py and sweep.py files depending on usage.

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Related tags

Overview

MixRNet(Using mixup as regularization and tuning hyper-parameters for ResNets)

Overview

Important

Training

Hyperparameters sweep

Running on custom dataset

Other files

Owner

Bhanu

Object detection using yolo-tiny model and opencv used as backend

Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"

PyTorch implementation of Deformable Convolution

PyTorch implementations of the NeRF model described in "NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis"

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations, CVPR 2019 (Oral)

Materials for my scikit-learn tutorial

The 2nd place solution of 2021 google landmark retrieval on kaggle.

[Nature Machine Intelligence' 21] "Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence"

Official PyTorch implementation of "VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization" (CVPR 2021)

Consensus Learning from Heterogeneous Objectives for One-Class Collaborative Filtering

ICCV2021 Expert-Goal Trajectory Prediction

This is code to fit per-pixel environment map with spherical Gaussian lobes, using LBFGS optimization

《Single Image Reflection Removal Beyond Linearity》(CVPR 2019)

Repo for "Event-Stream Representation for Human Gaits Identification Using Deep Neural Networks"

Attention-guided gan for synthesizing IR images

Code for our NeurIPS 2021 paper 'Exploiting the Intrinsic Neighborhood Structure for Source-free Domain Adaptation'

An implementation of EWC with PyTorch

Alpha-IoU: A Family of Power Intersection over Union Losses for Bounding Box Regression

Surrogate-Assisted Genetic Algorithm for Wrapper Feature Selection

[ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Chenyu You, Xiaohui Xie, Zhangyang Wang