A PyTorch implementation of the paper "Semantic Image Synthesis via Adversarial Learning" in ICCV 2017

Last update: Nov 25, 2022

Related tags

Deep Learning dong_iccv_2017

Overview

Semantic Image Synthesis via Adversarial Learning

This is a PyTorch implementation of the paper Semantic Image Synthesis via Adversarial Learning.

Requirements

PyTorch 0.2
Torchvision
Pillow
fastText.py (Note: if you have a problem when loading a pretrained model, try my fixed code)
NLTK

Pretrained word vectors for fastText

Download a pretrained English word vectors. You can see the list of pretrained vectors on this page.

Datasets

Oxford-102 flowers: images and captions
Caltech-200 birds: images and captions

The caption data is from this repository. After downloading, modify CONFIG file so that all paths of the datasets point to the data you downloaded.

Run

scripts/train_text_embedding_[birds/flowers].sh
Train a visual-semantic embedding model using the method of Kiros et al..
scripts/train_[birds/flowers].sh
Train a GAN using a pretrained text embedding model.
scripts/test_[birds/flowers].sh
Generate some examples using original images and semantically relevant texts.

Results

Acknowledgements

We would like to thank Hao Dong, who is one of the first authors of the paper Semantic Image Synthesis via Adversarial Learning, for providing helpful advice for the implementation.

A PyTorch implementation of the paper "Semantic Image Synthesis via Adversarial Learning" in ICCV 2017

Related tags

Overview

Semantic Image Synthesis via Adversarial Learning

Requirements

Pretrained word vectors for fastText

Datasets

Run

Results

Acknowledgements

Owner

Seonghyeon Nam

PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)

A cross-document event and entity coreference resolution system, trained and evaluated on the ECB+ corpus.

Joint Channel and Weight Pruning for Model Acceleration on Mobile Devices

Pytorch implementation of the paper DocEnTr: An End-to-End Document Image Enhancement Transformer.

IPATool-py: download ipa easily

Step by Step on how to create an vision recognition model using LOBE.ai, export the model and run the model in an Azure Function

Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS 2021 Spotlight

The source codes for TME-BNA: Temporal Motif-Preserving Network Embedding with Bicomponent Neighbor Aggregation.

TagLab: an image segmentation tool oriented to marine data analysis

Object detection using yolo-tiny model and opencv used as backend

Doods2 - API for detecting objects in images and video streams using Tensorflow

Pytorch Implementation of Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations

Distributionally robust neural networks for group shifts

[TPDS'21] COSCO: Container Orchestration using Co-Simulation and Gradient Based Optimization for Fog Computing Environments

TraSw for FairMOT - A Single-Target Attack example (Attack ID: 19; Screener ID: 24):

CarND-LaneLines-P1 - Lane Finding Project for Self-Driving Car ND

ProMP: Proximal Meta-Policy Search

Python script for performing depth completion from sparse depth and rgb images using the msg_chn_wacv20. model in Tensorflow Lite.

This program uses trial auth token of Azure Cognitive Services to do speech synthesis for you.