Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice

Last update: Jan 06, 2023

Overview

Splicing ViT Features for Semantic Appearance Transfer [Project Page]

Splice is a method for semantic appearance transfer, as described in Splicing ViT Features for Semantic Appearance Transfer (link to paper).

Given two input images—a source structure image and a target appearance image–our method generates a new image in which the structure of the source image is preserved, while the visual appearance of the target image is transferred in a semantically aware manner. That is, objects in the structure image are “painted” with the visual appearance of semantically related objects in the appearance image. Our method leverages a self-supervised, pre-trained ViT model as an external semantic prior. This allows us to train our generator only on a single input image pair, without any additional information (e.g., segmentation/correspondences), and without adversarial training. Thus, our framework can work across a variety of objects and scenes, and can generate high quality results in high resolution (e.g., HD).

Getting Started

Installation

git clone https://github.com/omerbt/Splice.git
pip install -r requirements.txt

Run examples

Run the following command to start training

python train.py --dataroot datasets/cows

Intermediate results will be saved to /out/output.png during optimization. The frequency of saving intermediate results is indicated in the save_epoch_freq flag of the configuration.

Sample Results

Citation

@article{Splice2022,
    author = {Tumanyan, Narek
              and Bar-Tal, Omer
              and Bagon, Shai
              and Dekel, Tali
              },
    title = {Splicing ViT Features for Semantic Appearance Transfer}, 
    journal = {arXiv preprint arXiv:2201.00424},
    year  = {2022}
}

Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice

Related tags

Overview

Splicing ViT Features for Semantic Appearance Transfer [Project Page]

Getting Started

Installation

Run examples

Sample Results

Citation

Owner

Omer Bar Tal

Datasets, Transforms and Models specific to Computer Vision

pytorchのスライス代入操作をonnxに変換する際にScatterNDならないようにするサンプル

Unsupervised Image Generation with Infinite Generative Adversarial Networks

A new codebase for Group Activity Recognition. It contains codes for ICCV 2021 paper: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition and some other methods.

for a paper about leveraging discourse markers for training new models

Discovering Dynamic Salient Regions with Spatio-Temporal Graph Neural Networks

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

Towards Improving Embedding Based Models of Social Network Alignment via Pseudo Anchors

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.

A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Review).

A deep learning model for style-specific music generation.

PyTorch DepthNet Training on Still Box dataset

Official Pytorch implementation for video neural representation (NeRV)

code for ICCV 2021 paper 'Generalized Source-free Domain Adaptation'

MLJetReconstruction - using machine learning to reconstruct jets for CMS

Deep Learning Head Pose Estimation using PyTorch.

Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning"

A collection of implementations of deep domain adaptation algorithms

RuDOLPH: One Hyper-Modal Transformer can be creative as DALL-E and smart as CLIP