Pytorch implementation of the paper Improving Text-to-Image Synthesis Using Contrastive Learning

Last update: Dec 31, 2022

Related tags

Deep Learning T2I_CL

Overview

T2I_CL

This is the official Pytorch implementation of the paper Improving Text-to-Image Synthesis Using Contrastive Learning

Requirements

Linux
Python ≥ 3.6
PyTorch ≥ 1.4.0

Prepare Data

Download the preprocessed datasets from AttnGAN

Alternatively, another site is from DM-GAN

Training

Pretrain DAMSM+CL:
- For bird dataset: python pretrain_DAMSM.py --cfg cfg/DAMSM/bird.yml --gpu 0
- For coco dataset: python pretrain_DAMSM.py --cfg cfg/DAMSM/coco.yml --gpu 0
Train AttnGAN+CL:
- For bird dataset: python main.py --cfg cfg/bird_attn2.yml --gpu 0
- For coco dataset: python main.py --cfg cfg/coco_attn2.yml --gpu 0
Train DM-GAN+CL:
- For bird dataset: python main.py --cfg cfg/bird_DMGAN.yml --gpu 0
- For coco dataset: python main.py --cfg cfg/coco_DMGAN.yml --gpu 0

Pretrained Models

DAMSM+CL for bird. Download and save it to DAMSMencoders/
DAMSM+CL for coco. Download and save it to DAMSMencoders/
AttnGAN+CL for bird. Download and save it to models/
AttnGAN+CL for coco. Download and save it to models/
DM-GAN+CL for bird. Download and save it to models/
DM-GAN+CL for coco. Download and save it to models/

Evaluation

Sampling and get the R-precision:
- python main.py --cfg cfg/eval_bird.yml --gpu 0
- python main.py --cfg cfg/eval_coco.yml --gpu 0
Inception score:
- python inception_score_bird.py --image_folder fake_images_bird
- python inception_score_coco.py fake_images_coco
FID:
- python fid_score.py --gpu 0 --batch-size 50 --path1 real_images_bird --path2 fake_images_bird
- python fid_score.py --gpu 0 --batch-size 50 --path1 real_images_coco --path2 fake_images_coco

Citation

If you find this work useful in your research, please consider citing:

@article{ye2021improving,
  title={Improving Text-to-Image Synthesis Using Contrastive Learning},
  author={Ye, Hui and Yang, Xiulong and Takac, Martin and Sunderraman, Rajshekhar and Ji, Shihao},
  journal={arXiv preprint arXiv:2107.02423},
  year={2021}
}

Acknowledge

Our work is based on the following works:

Pytorch implementation of the paper Improving Text-to-Image Synthesis Using Contrastive Learning

Related tags

Overview

T2I_CL

Requirements

Prepare Data

Training

Pretrained Models

Evaluation

Citation

Acknowledge

Owner

Weakly Supervised End-to-End Learning (NeurIPS 2021)

Codes for CyGen, the novel generative modeling framework proposed in "On the Generative Utility of Cyclic Conditionals" (NeurIPS-21)

The official project of SimSwap (ACM MM 2020)

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

Implementation of the method proposed in the paper "Neural Descriptor Fields: SE(3)-Equivariant Object Representations for Manipulation"

Code release for NeuS

Controlling the MicriSpotAI robot from scratch

Code and datasets for the paper "Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction" (RA-L, 2021)

Compressed Video Action Recognition

The official implementation of Equalization Loss for Long-Tailed Object Recognition (CVPR 2020) based on Detectron2

Python Classes: Medical Insurance Project using Object Oriented Programming Concepts

Code for our EMNLP 2021 paper “Heterogeneous Graph Neural Networks for Keyphrase Generation”

Code for the ICML 2021 paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

v objective diffusion inference code for PyTorch.

Official Code Implementation of the paper : XAI for Transformers: Better Explanations through Conservative Propagation

QR2Pass-project - A proof of concept for an alternative (passwordless) authentication system to a web server

Everything you want about DP-Based Federated Learning, including Papers and Code. (Mechanism: Laplace or Gaussian, Dataset: femnist, shakespeare, mnist, cifar-10 and fashion-mnist. )

COVINS -- A Framework for Collaborative Visual-Inertial SLAM and Multi-Agent 3D Mapping

CR-Fill: Generative Image Inpainting with Auxiliary Contextual Reconstruction. ICCV 2021

A repository for the paper "Improved Adversarial Systems for 3D Object Generation and Reconstruction".