Code associated with the paper "Towards Understanding the Data Dependency of Mixup-style Training".

Last update: Nov 11, 2021

Related tags

Overview

Mixup-Data-Dependency

Code associated with the paper "Towards Understanding the Data Dependency of Mixup-style Training".

Running Alternating Line Experiments

In order to generate the plots found in Section 2.3 ("A Mixup Failure Case"), one can run the following command for different values of alpha.

python3 tasks/train_models.py --task-name NCAL --alpha 128 --num-runs 10

If running using slurm, it is also possible to just run:

./tasks/run_task_with_erm.sh NCAL 128 10 0

The generated output files can be found under runs/ and plots/ with file names based on the provided parameters.

Running Image Classification Experiments

In order to generate the plots found in Section 2.4 ("Sufficient Conditions for Minimizing the Original Risk"), one can run the following commands for different values of alpha.

python3 tasks/train_models.py --task-name MNIST --alpha 1024 --num-runs 5
python3 tasks/train_models.py --task-name CIFAR10 --alpha 1024 --num-runs 5
python3 tasks/train_models.py --task-name CIFAR100 --alpha 1024 --num-runs 5

Once again, if running using slurm it is possible to instead run ./tasks/run_task_with_erm.sh with the same arguments as above and an additional fourth argument set to 0. As before, output files can be found in runs/ and plots/.

Running Angular Distance Analysis

To recreate the approximate epsilon computation found in Section 2.4 (in the discussion of application of sufficient conditions), one can run the following command after manually setting subset_prop and alpha in analysis/mixup_point_analysis.py.

python3 analysis/mixup_point_analysis.py

Running Two Moons Experiments

To recreate the two moons experiments found in Section 3.1 ("The Margin of Mixup Classifiers"), set alpha_1 and alpha_2 in tasks/two_moons/py to the mixing parameters to be compared and then run the following command.

python3 tasks/two_moons.py

Code associated with the paper "Towards Understanding the Data Dependency of Mixup-style Training".

Related tags

Overview

Mixup-Data-Dependency

Running Alternating Line Experiments

Running Image Classification Experiments

Running Angular Distance Analysis

Running Two Moons Experiments

Owner

Muthu Chidambaram

DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)

Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer

Official implementation of the network presented in the paper "M4Depth: A motion-based approach for monocular depth estimation on video sequences"

Recurrent Neural Network Tutorial, Part 2 - Implementing a RNN in Python and Theano

Using VideoBERT to tackle video prediction

This repository contains implementations and illustrative code to accompany DeepMind publications

PFFDTD is an open-source FDTD simulator for 3D room acoustics

Source code for "Interactive All-Hex Meshing via Cuboid Decomposition [SIGGRAPH Asia 2021]".

Learning To Have An Ear For Face Super-Resolution

[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.

Time Series Forecasting with Temporal Fusion Transformer in Pytorch

Code for the submitted paper Surrogate-based cross-correlation for particle image velocimetry

Real-world Anomaly Detection in Surveillance Videos- pytorch Re-implementation

GT4SD, an open-source library to accelerate hypothesis generation in the scientific discovery process.

KDD CUP 2020 Automatic Graph Representation Learning: 1st Place Solution

Code implementation of Data Efficient Stagewise Knowledge Distillation paper.

The code for paper "Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation" which is accepted by AAAI 2022

Allows including an action inside another action (by preprocessing the Yaml file). This is how composite actions should have worked.

This repo is to present various code demos on how to use our Graph4NLP library.

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)