Codes for CyGen, the novel generative modeling framework proposed in "On the Generative Utility of Cyclic Conditionals" (NeurIPS-21)

Last update: Nov 16, 2022

Overview

On the Generative Utility of Cyclic Conditionals

This repository is the official implementation of "On the Generative Utility of Cyclic Conditionals" (NeurIPS 2021).

Chang Liu <[email protected]>, Haoyue Tang, Tao Qin, Jintao Wang, Tie-Yan Liu.
[Paper & Appendix] [Slides] [Video] [Poster]

Introduction

Whether and how can two conditional models p(x|z) and q(z|x) that form a cycle uniquely determine a joint distribution p(x,z)? We develop a general theory for this question, including criteria for the two conditionals to correspond to a common joint (compatibility) and for such joint to be unique (determinacy). As in generative models we need a generator (decoder/likelihood model) and also an encoder (inference model) for representation, the theory indicates they could already define a generative model p(x,z) without specifying a prior distribution p(z)! We call this novel generative modeling framework as CyGen, and develop methods to achieve the eligibility (compatibility and determinacy) and the usage (fitting and generating data) as a generative model.

This codebase implements these CyGen methods, and various baseline methods. The model architectures are based on the Sylvester flow (Householder version), and the experiment environments/setups follow FFJORD. Authorship is clarified in each file.

Requirements

The code requires python version >= 3.6, and is based on PyTorch. To install requirements:

pip install -r requirements.txt

Usage

Run the run_toy.sh and run_image.sh scripts for the synthetic and real-world (i.e. MNIST and SVHN) experiments. See the commands in the script files or python3 main_[toy|image].py --help for customized usage or hyperparameter tuning.

For the real-world experiments, downstream classification accuracy is evaluated along training. To evaluate the FID score, run the command python3 compute_gen_fid.py --load_dict=<path_to_model.pth>.

Results

As a trailer, we show the synthetic results here. We see that CyGen achieves both high-quality data generation, and well-separated latent clusters (useful representation). This is due to the removal of a specified prior distribution so that the manifold mismatch and posterior collapse problems are avoided. DAE (denoising auto-encoder) does not need a prior, but its training method hurts determinacy. If pretrained as a VAE (i.e. CyGen(PT)), we see that the knowledge of a centered and centrosymmetric prior is encoded through the conditional models. See the paper for more results.

Codes for CyGen, the novel generative modeling framework proposed in "On the Generative Utility of Cyclic Conditionals" (NeurIPS-21)

Related tags

Overview

On the Generative Utility of Cyclic Conditionals

Introduction

Requirements

Usage

Results

Owner

Chang Liu

Official pytorch implementation of the AAAI 2021 paper Semantic Grouping Network for Video Captioning

[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games

Implementation of "Scaled-YOLOv4: Scaling Cross Stage Partial Network" using PyTorch framwork.

A coin flip game in which you can put the amount of money below or equal to 1000 and then choose heads or tail

Explaining Hyperparameter Optimization via PDPs

Official implementation of "SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers"

The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question IntentionClassification Benchmark for Text-to-SQL"

Voice Gender Recognition

Mind the Trade-off: Debiasing NLU Models without Degrading the In-distribution Performance

A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.

A Simple Long-Tailed Rocognition Baseline via Vision-Language Model

A Dataset for Direct Quotation Extraction and Attribution in News Articles.

Optimizes image files by converting them to webp while also updating all references.

Parameter Efficient Deep Probabilistic Forecasting

Implement object segmentation on images using HOG algorithm proposed in CVPR 2005

Final report with code for KAIST Course KSE 801.

Segmentation models with pretrained backbones. PyTorch.

Experiments for distributed optimization algorithms

This is the pytorch re-implementation of the IterNorm

Deep Face Recognition in PyTorch