Learning What and Where to Draw

Last update: Nov 18, 2022

Related tags

Deep Learning nips2016

Overview

###Learning What and Where to Draw Scott Reed, Zeynep Akata, Santosh Mohan, Samuel Tenka, Bernt Schiele, Honglak Lee

This is the code for our NIPS 2016 paper on text- and location-controllable image synthesis using conditional GANs. Much of the code is adapted from reedscot/icml2016 and dcgan.torch.

####Setup Instructions

You will need to install Torch, CuDNN, stnbhwd and the display package.

####How to train a text to image model:

Download the data including captions, location annotations and pretrained models.
Download the birds and humans image data.
Modify the CONFIG file to point to your data.
Run one of the training scripts, e.g. ./scripts/train_cub_keypoints.sh

####How to generate samples:

./scripts/run_all_demos.sh.
html files will be generated with results like the following:

Moving the bird's position via bounding box:

Moving the bird's position via keypoints:

Birds text to image with ground-truth keypoints:

Birds text to image with generated keypoints:

Humans text to image with ground-truth keypoints:

Humans text to image with generated keypoints:

####Citation

If you find this useful, please cite our work as follows:

@inproceedings{reed2016learning,
  title={Learning What and Where to Draw},
  author={Scott Reed and Zeynep Akata and Santosh Mohan and Samuel Tenka and Bernt Schiele and Honglak Lee},
  booktitle={Advances in Neural Information Processing Systems},
  year={2016}
}

Learning What and Where to Draw

Related tags

Overview

Owner

Scott Ellison Reed

Implementation of fast algorithms for Maximum Spanning Tree (MST) parsing that includes fast ArcMax+Reweighting+Tarjan algorithm for single-root dependency parsing.

potpourri3d - An invigorating blend of 3D geometry tools in Python.

Data and code for the paper "Importance of Kernel Bandwidth in Quantum Machine Learning"

Python scripts for performing stereo depth estimation using the MobileStereoNet model in Tensorflow Lite.

Interactive Terraform visualization. State and configuration explorer.

Designing a Practical Degradation Model for Deep Blind Image Super-Resolution (ICCV, 2021) (PyTorch) - We released the training code!

A simple software for capturing human body movements using the Kinect camera.

[ICCV-2021] An Empirical Study of the Collapsing Problem in Semi-Supervised 2D Human Pose Estimation

StyleGAN-NADA: CLIP-Guided Domain Adaptation of Image Generators

Code for ACL2021 paper Consistency Regularization for Cross-Lingual Fine-Tuning.

Monitor your ML jobs on mobile devices📱, especially for Google Colab / Kaggle

Unet network with mean teacher for altrasound image segmentation

An imperfect information game is a type of game with asymmetric information

Code for "Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation" ICCV'21

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

DGCNN - Dynamic Graph CNN for Learning on Point Clouds

MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.

BasicRL: easy and fundamental codes for deep reinforcement learning。It is an improvement on rainbow-is-all-you-need and OpenAI Spinning Up.

A repo to show how to use custom dataset to train s2anet, and change backbone to resnext101

Time Series Cross-Validation -- an extension for scikit-learn