Release for Improved Denoising Diffusion Probabilistic Models

Last update: Dec 30, 2022

Related tags

Miscellaneous improved-diffusion

Overview

improved-diffusion

This is the codebase for Improved Denoising Diffusion Probabilistic Models.

Usage

This section of the README walks through how to train and sample from a model.

Installation

Clone this repository and navigate to it in your terminal. Then run:

pip install -e .

This should install the improved_diffusion python package that the scripts depend on.

Preparing Data

The training code reads images from a directory of image files. In the datasets folder, we have provided instructions/scripts for preparing these directories for ImageNet, LSUN bedrooms, and CIFAR-10.

For creating your own dataset, simply dump all of your images into a directory with ".jpg", ".jpeg", or ".png" extensions. If you wish to train a class-conditional model, name the files like "mylabel1_XXX.jpg", "mylabel2_YYY.jpg", etc., so that the data loader knows that "mylabel1" and "mylabel2" are the labels. Subdirectories will automatically be enumerated as well, so the images can be organized into a recursive structure (although the directory names will be ignored, and the underscore prefixes are used as names).

The images will automatically be scaled and center-cropped by the data-loading pipeline. Simply pass --data_dir path/to/images to the training script, and it will take care of the rest.

Training

To train your model, you should first decide some hyperparameters. We will split up our hyperparameters into three groups: model architecture, diffusion process, and training flags. Here are some reasonable defaults for a baseline:

MODEL_FLAGS="--image_size 64 --num_channels 128 --num_res_blocks 3"
DIFFUSION_FLAGS="--diffusion_steps 4000 --noise_schedule linear"
TRAIN_FLAGS="--lr 1e-4 --batch_size 128"

Here are some changes we experiment with, and how to set them in the flags:

Learned sigmas: add --learn_sigma True to MODEL_FLAGS
Cosine schedule: change --noise_schedule linear to --noise_schedule cosine
Reweighted VLB: add --use_kl True to DIFFUSION_FLAGS and add --schedule_sampler loss-second-moment to TRAIN_FLAGS.
Class-conditional: add --class_cond True to MODEL_FLAGS.

Once you have setup your hyper-parameters, you can run an experiment like so:

python scripts/image_train.py --data_dir path/to/images $MODEL_FLAGS $DIFFUSION_FLAGS $TRAIN_FLAGS

You may also want to train in a distributed manner. In this case, run the same command with mpiexec:

mpiexec -n $NUM_GPUS python scripts/image_train.py --data_dir path/to/images $MODEL_FLAGS $DIFFUSION_FLAGS $TRAIN_FLAGS

When training in a distributed manner, you must manually divide the --batch_size argument by the number of ranks. In lieu of distributed training, you may use --microbatch 16 (or --microbatch 1 in extreme memory-limited cases) to reduce memory usage.

The logs and saved models will be written to a logging directory determined by the OPENAI_LOGDIR environment variable. If it is not set, then a temporary directory will be created in /tmp.

Sampling

The above training script saves checkpoints to .pt files in the logging directory. These checkpoints will have names like ema_0.9999_200000.pt and model200000.pt. You will likely want to sample from the EMA models, since those produce much better samples.

Once you have a path to your model, you can generate a large batch of samples like so:

python scripts/image_sample.py --model_path /path/to/model.pt $MODEL_FLAGS $DIFFUSION_FLAGS

Again, this will save results to a logging directory. Samples are saved as a large npz file, where arr_0 in the file is a large batch of samples.

Just like for training, you can run image_sample.py through MPI to use multiple GPUs and machines.

You can change the number of sampling steps using the --timestep_respacing argument. For example, --timestep_respacing 250 uses 250 steps to sample. Passing --timestep_respacing ddim250 is similar, but uses the uniform stride from the DDIM paper rather than our stride.

To sample using DDIM, pass --use_ddim True.

Release for Improved Denoising Diffusion Probabilistic Models

Related tags

Overview

improved-diffusion

Usage

Installation

Preparing Data

Training

Sampling

Owner

OpenAI

Identifies the faulty wafer before it can be used for the fabrication of integrated circuits and, in photovoltaics, to manufacture solar cells.

Fithub is a website application for athletes and fitness enthusiasts of all ages and experience levels.

Winxp_python3.6.15 - Python 3.6.15 For Windows XP SP3

Code for Crowd counting via unsupervised cross-domain feature adaptation.

A module that can manage you're gtps

Larvamatch - Find your larva or punk match.

fetchmesh is a tool to simplify working with Atlas anchoring mesh measurements

Repositório de código de curso de Djavue ministrado na Python Brasil 2021

Linux GUI app to codon optimize many single-fasta files with coding sequences , using many taxonomy ids

Information about a signed UEFI Shell that can be used when Secure Boot is enabled.

Python with braces. Because Python is awesome, but whitespace is awful.

PhD document for navlab

Terminal compatible with ansi-bbs. Meant to be a prototype, but published because why not.

This is a python package to get wards, districts,cities and provinces in Zimbabwe

Unified Distributed Execution

Wrappers around the most common maya.cmds and maya.api use cases

HOWTO: Downgrade from nYNAB to YNAB4

This is a simple quizz which can ask user for login/register session, then consult to the Quiz interface.

A Dungeon and Dragons Toolkit using Python

An Advent calendar of small programming puzzles for a variety of skill sets and skill levels.