pytorch implementation of fast-neural-style

Last update: Dec 15, 2022

Overview

fast-neural-style 🌇 🚀

NOTICE: This codebase is no longer maintained, please use the codebase from pytorch examples repository available at pytorch/examples/fast_neural_style.

This repository contains a pytorch implementation of an algorithm for artistic style transfer. The algorithm can be used to mix the content of an image with the style of another image. For example, here is a photograph of a door arch rendered in the style of a stained glass painting.

The model uses the method described in Perceptual Losses for Real-Time Style Transfer and Super-Resolution along with Instance Normalization. The saved-models for examples shown in the README can be downloaded from here.

DISCLAIMER: This implementation is also a part of the pytorch examples repository. Implementation in this repository uses pretrained Caffe2 VGG whereas the pytorch examples repository implementation uses pretrained Pytorch VGG. The two VGGs have different preprocessings which results in different --content-weight and --style-weight parameters. The styled output images also look slightly different.

Requirements

The program is written in Python, and uses pytorch, scipy. A GPU is not necessary, but can provide a significant speed up especially for training a new model. Regular sized images can be styled on a laptop, desktop using saved models.

Setup the environnment

Run with virtualenv

Create a virtualenv with python3.5 or python3.6. Older versions are not supported due to a lack of compatibilty with pytorch.

python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Run with Docker

Build the image:

docker build . -t fast-neural-style

Run the container:

docker run --rm --volume "$(pwd)/:/data" style eval --content-image /data/image.jpg --model /app/saved-models/mosaic.pth --output-image /data/output.jpg --cuda 0

Usage

Stylize image

python neural_style/neural_style.py eval --content-image </path/to/content/image> --model </path/to/saved/model> --output-image </path/to/output/image> --cuda 0

--content-image: path to content image you want to stylize.
--model: saved model to be used for stylizing the image (eg: mosaic.pth)
--output-image: path for saving the output image.
--content-scale: factor for scaling down the content image if memory is an issue (eg: value of 2 will halve the height and width of content-image)
--cuda: set it to 1 for running on GPU, 0 for CPU.

Train model

python neural_style/neural_style.py train --dataset </path/to/train-dataset> --style-image </path/to/style/image> --vgg-model-dir </path/to/vgg/folder> --save-model-dir </path/to/save-model/folder> --epochs 2 --cuda 1

There are several command line arguments, the important ones are listed below

--dataset: path to training dataset, the path should point to a folder containing another folder with all the training images. I used COCO 2014 Training images dataset [80K/13GB] (download).
--style-image: path to style-image.
--vgg-model-dir: path to folder where the vgg model will be downloaded.
--save-model-dir: path to folder where trained model will be saved.
--cuda: set it to 1 for running on GPU, 0 for CPU.

Refer to neural_style/neural_style.py for other command line arguments.

Models

Models for the examples shown below can be downloaded from here or by running the script download_styling_models.sh.

pytorch implementation of fast-neural-style

Related tags

Overview

fast-neural-style 🌇 🚀

Requirements

Setup the environnment

Run with virtualenv

Run with Docker

Usage

Models

Owner

Abhishek Kadian

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

This's an implementation of deepmind Visual Interaction Networks paper using pytorch

Jittor Medical Segmentation Lib -- The assignment of Pattern Recognition course (2021 Spring) in Tsinghua University

PRIME: A Few Primitives Can Boost Robustness to Common Corruptions

PyTorch code for 'Efficient Single Image Super-Resolution Using Dual Path Connections with Multiple Scale Learning'

Source code for TACL paper "KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation".

Hybrid Neural Fusion for Full-frame Video Stabilization

Pytorch Implementation of "Contrastive Representation Learning for Exemplar-Guided Paraphrase Generation"

ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.

Implementation of Graph Transformer in Pytorch, for potential use in replicating Alphafold2

Multiple types of NN model optimization environments. It is possible to directly access the host PC GUI and the camera to verify the operation. Intel iHD GPU (iGPU) support. NVIDIA GPU (dGPU) support.

Discerning Decision-Making Process of Deep Neural Networks with Hierarchical Voting Transformation

Citation Intent Classification in scientific papers using the Scicite dataset an Pytorch

This is an official implementation for "AS-MLP: An Axial Shifted MLP Architecture for Vision".

ViViT: Curvature access through the generalized Gauss-Newton's low-rank structure

Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"

A python software that can help blind people find things like laptops, phones, etc the same way a guide dog guides a blind person in finding his way.

TensorFlow implementation of Barlow Twins (Barlow Twins: Self-Supervised Learning via Redundancy Reduction)

FairMOT - A simple baseline for one-shot multi-object tracking

Short and long time series classification using convolutional neural networks