Unsupervised Image Generation with Infinite Generative Adversarial Networks

Here is the implementation of MICGANs using DCGAN architecture on MNIST dataset with jittor.

Install Requirements

jittor
packages in requirements.txt

Training

You can train the model by using the following command:

bash Scripts/mnist.sh

The above training is under the configs in './configs/default.yaml'.

Visualization

During training, for the better understanding of CRP sampling procedure, we visualize the classification results of the real images on each state of the CRP sampling procedure. And the visulization results are in the 'output/mnist/crp/mode_label', 'output/mnist/crp/sorted_mode_label' and 'output/mnist/crp/label_mode'.

The images in the 'mode_label' are like the following:

In the image, the x-axis is the mode id, the y-axis is the number of the real images classified to the mode, and the color represents the ground-truth label. Images in the 'sorted_mode_label' shows the sorted results. For images in the 'label_mode', the x-axis the ground-truth label, and the color represents mode id.

Notes

The CRP sampling procedure is at its core Markov Chain Monte Carlo (MCMC) sampling. Other than plotting the likelihood, another good way to see if it has mixed is to plot out the clustering results as the above image. Also, since it is MCMC, different samples could give slightly different results even after it mixes.

Citations

Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follows.

@inproceedings{ying2021unsupervised,
  title={Unsupervised Image Generation with Infinite Generative Adversarial Networks},
  author={Ying, Hui and Wang, He and Shao, Tianjia and Yang, Yin and Zhou, Kun},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={14284--14293},
  year={2021}
}

Unsupervised Image Generation with Infinite Generative Adversarial Networks

Related tags

Overview

Unsupervised Image Generation with Infinite Generative Adversarial Networks

Install Requirements

Training

Visualization

Notes

Citations

Owner

A PyTorch Implementation of SphereFace.

DUE: End-to-End Document Understanding Benchmark

Pytorch implementation for Patient Knowledge Distillation for BERT Model Compression

Codes and scripts for "Explainable Semantic Space by Grounding Languageto Vision with Cross-Modal Contrastive Learning"

This solves the autonomous driving issue which is supported by deep learning technology. Given a video, it splits into images and predicts the angle of turning for each frame.

DeepHawkeye is a library to detect unusual patterns in images using features from pretrained neural networks

Differentiable architecture search for convolutional and recurrent networks

Coarse implement of the paper "A Simultaneous Denoising and Dereverberation Framework with Target Decoupling", On DNS-2020 dataset, the DNSMOS of first stage is 3.42 and second stage is 3.47.

NOMAD - A blackbox optimization software

Custom implementation of Corrleation Module

Bayesian-Torch is a library of neural network layers and utilities extending the core of PyTorch to enable the user to perform stochastic variational inference in Bayesian deep neural networks

Sign Language is detected in realtime using video sequences. Our approach involves MediaPipe Holistic for keypoints extraction and LSTM Model for prediction.

CTF challenges and write-ups for MicroCTF 2021.

[CVPRW 2021] Code for Region-Adaptive Deformable Network for Image Quality Assessment

Taking A Closer Look at Domain Shift: Category-level Adversaries for Semantics Consistent Domain Adaptation

PyTorch evaluation code for Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.

[ICCV 2021] Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation

NLMpy - A Python package to create neutral landscape models

Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification

CSKG is a commonsense knowledge graph that combines seven popular sources into a consolidated representation