Code accompanying the paper "Wasserstein GAN"

Last update: Jan 01, 2023

Related tags

Deep Learning WassersteinGAN

Overview

Wasserstein GAN

Code accompanying the paper "Wasserstein GAN"

A few notes

The first time running on the LSUN dataset it can take a long time (up to an hour) to create the dataloader. After the first run a small cache file will be created and the process should take a matter of seconds. The cache is a list of indices in the lmdb database (of LSUN)
The only addition to the code (that we forgot, and will add, on the paper) are the lines 163-166 of main.py. These lines act only on the first 25 generator iterations or very sporadically (once every 500 generator iterations). In such a case, they set the number of iterations on the critic to 100 instead of the default 5. This helps to start with the critic at optimum even in the first iterations. There shouldn't be a major difference in performance, but it can help, especially when visualizing learning curves (since otherwise you'd see the loss going up until the critic is properly trained). This is also why the first 25 iterations take significantly longer than the rest of the training as well.
If your learning curve suddenly takes a big drop take a look at this. It's a problem when the critic fails to be close to optimum, and hence its error stops being a good Wasserstein estimate. Known causes are high learning rates and momentum, and anything that helps the critic get back on track is likely to help with the issue.

Prerequisites

Computer with Linux or OSX
PyTorch
For training, an NVIDIA GPU is strongly recommended for speed. CPU is supported but training is very slow.

Two main empirical claims:

Generator sample quality correlates with discriminator loss

Improved model stability

Reproducing LSUN experiments

With DCGAN:

python main.py --dataset lsun --dataroot [lsun-train-folder] --cuda

With MLP:

python main.py --mlp_G --ngf 512

Generated samples will be in the samples folder.

If you plot the value -Loss_D, then you can reproduce the curves from the paper. The curves from the paper (as mentioned in the paper) have a median filter applied to them:

med_filtered_loss = scipy.signal.medfilt(-Loss_D, dtype='float64'), 101)

Code accompanying the paper "Wasserstein GAN"

Related tags

Overview

Wasserstein GAN

A few notes

Prerequisites

Generator sample quality correlates with discriminator loss

Improved model stability

Reproducing LSUN experiments

Owner

Deep Learning for Natural Language Processing SS 2021 (TU Darmstadt)

Official repository for "Restormer: Efficient Transformer for High-Resolution Image Restoration". SOTA results for single-image motion deblurring, image deraining, image denoising (synthetic and real data), and dual-pixel defocus deblurring.

Detectron2-FC a fast construction platform of neural network algorithm based on detectron2

Optical Character Recognition + Instance Segmentation for russian and english languages

A tensorflow implementation of an HMM layer

A multi-entity Transformer for multi-agent spatiotemporal modeling.

Implementation of the paper "Shapley Explanation Networks"

Sync2Gen Code for ICCV 2021 paper: Scene Synthesis via Uncertainty-Driven Attribute Synchronization

Anomaly detection in multi-agent trajectories: Code for training, evaluation and the OpenAI highway simulation.

Codebase for the solution that won first place and was awarded the most human-like agent in the 2021 NeurIPS Competition MineRL BASALT Challenge.

Pyramid Grafting Network for One-Stage High Resolution Saliency Detection. CVPR 2022

Tensorflow port of a full NetVLAD network

Python implementation of Wu et al (2018)'s registration fusion

An official PyTorch implementation of the TKDE paper "Self-Supervised Graph Representation Learning via Topology Transformations".

ViewFormer: NeRF-free Neural Rendering from Few Images Using Transformers

Streaming Anomaly Detection Framework in Python (Outlier Detection for Streaming Data)

MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.

This repository provides an unified frameworks to train and test the state-of-the-art few-shot font generation (FFG) models.

R3Det based on mmdet 2.19.0

This is the repo of the manuscript "Dual-branch Attention-In-Attention Transformer for speech enhancement"