SMIS - Semantically Multi-modal Image Synthesis(CVPR 2020)

Last update: Dec 01, 2022

Related tags

Deep Learning SMIS

Overview

Semantically Multi-modal Image Synthesis

Project page / Paper / Demo

Semantically Multi-modal Image Synthesis(CVPR2020).
Zhen Zhu, Zhiliang Xu, Ansheng You, Xiang Bai

Requirements

torch>=1.0.0
torchvision
dominate
dill
scikit-image
tqdm
opencv-python

Getting Started

Data Preperation

DeepFashion
Note: We provide an example of the DeepFashion dataset. That is slightly different from the DeepFashion used in our paper due to the impact of the COVID-19.

Cityscapes
The Cityscapes dataset can be downloaded at here

ADE20K
The ADE20K dataset can be downloaded at here

Test/Train the models

Download the tar of the pretrained models from the Google Drive Folder. Save it in checkpoints/ and unzip it. There are deepfashion.sh, cityscapes.sh and ade20k.sh in the scripts folder. Change the parameters like --dataroot and so on, then comment or uncomment some code to test/train model. And you can specify the --test_mask for SMIS test.

Acknowledgments

Our code is based on the popular SPADE

SMIS - Semantically Multi-modal Image Synthesis(CVPR 2020)

Related tags

Overview

Semantically Multi-modal Image Synthesis

Project page / Paper / Demo

Requirements

Getting Started

Data Preperation

Test/Train the models

Acknowledgments

Owner

This is the official implementation of the paper "Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation".

Neural Oblivious Decision Ensembles

Simple transformer model for CIFAR10

Aws-machine-learning-university-accelerated-tab - Machine Learning University: Accelerated Tabular Data Class

Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code

Registration Loss Learning for Deep Probabilistic Point Set Registration

ivadomed is an integrated framework for medical image analysis with deep learning.

GNEE - GAT Neural Event Embeddings

PHOTONAI is a high level python API for designing and optimizing machine learning pipelines.

A configurable, tunable, and reproducible library for CTR prediction

Evolution Strategies in PyTorch

Code for the published paper : Learning to recognize rare traffic sign

The description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.

This repository contains numerical implementation for the paper Intertemporal Pricing under Reference Effects: Integrating Reference Effects and Consumer Heterogeneity.

Fuzzy Overclustering (FOC)

a delightful machine learning tool that allows you to train, test and use models without writing code

Optical machine for senses sensing using speckle and deep learning

Reliable probability face embeddings

Official implementation of paper Gradient Matching for Domain Generalization

This library is a location of the LegacyLogger for PyTorch Lightning.