This project helps to colorize grayscale images using multiple exemplars.

Last update: Aug 05, 2022

Overview

Multiple Exemplar-based Deep Colorization (Pytorch Implementation)

Pretrained Model

[Jitendra Chautharia](IIT Jodhpur)^1,3,

Prerequisites

Python 3.6+
Nvidia GPU + CUDA, CuDNN

Installation

First use the following commands to prepare the environment:

conda create -n ColorVid python=3.6
source activate ColorVid
pip install -r requirements.txt

Then, download the pretrained models from this link, unzip the file and place the files into the corresponding folders:

video_moredata_l1 under the checkpoints folder
vgg19_conv.pth and vgg19_gray.pth under the data folder

Data Preparation

In order to colorize your own video, it requires to extract the video frames, and provide a reference image as an example.

Place your Target grayscale image into one folder, e.g., ./exp_sample/target
Place your reference images into another folder, e.g., ./exp_sample/references

If you want to automatically retrieve color images, you can try the retrieval algorithm from this link which will retrieve similar images from the ImageNet dataset. Or you can try this link on your own image database.

Test

python test.py --image-size [image-size] \
               --clip_path [path-to-target-grayscale-image] \
               --ref_path [path-to-reference] \
               --output_path [path-to-output]

We provide several sample video clips with corresponding references. For example, one can colorize one sample legacy video using:

python test.py --clip_path ./exp_sample/target \
               --ref_path ./exp_sample/references \
               --output_path ./exp_sample/output

Note that we use 216*384 images for training, which has aspect ratio of 1:2. During inference, we scale the input to this size and then rescale the output back to the original size.

Train

We also provide training code for reference. The training can be started by running:

python --data_root [root of video samples] \
       --data_root_imagenet [root of image samples] \
       --gpu_ids [gpu ids] \

We do not provide the full video dataset due to the copyright issue. For image samples, we retrieve semantically similar images from ImageNet using this repository. Still, one can refer to our code to understand the detailed procedure of augmenting the image dataset to mimic the video frames.

This project helps to colorize grayscale images using multiple exemplars.

Related tags

Overview

Multiple Exemplar-based Deep Colorization (Pytorch Implementation)

Prerequisites

Installation

Data Preparation

Test

Train

Comparison with State-of-the-Arts

Owner

jitendra chautharia

PyTorch implementation of DUL (Data Uncertainty Learning in Face Recognition, CVPR2020)

Image Captioning using CNN ,LSTM and Attention

Multimodal Co-Attention Transformer (MCAT) for Survival Prediction in Gigapixel Whole Slide Images

Measures input lag without dedicated hardware, performing motion detection on recorded or live video

UnsupervisedR&R: Unsupervised Pointcloud Registration via Differentiable Rendering

Digitalizing-Prescription-Image - PIRDS - Prescription Image Recognition and Digitalizing System is a OCR make with Tensorflow

Affine / perspective transformation in Pose Estimation with Tensorflow 2

Using deep learning to predict gene structures of the coding genes in DNA sequences of Arabidopsis thaliana

Code for the ICML 2021 paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training

GPU-accelerated PyTorch implementation of Zero-shot User Intent Detection via Capsule Neural Networks

Official repository for "PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation"

FluidNet re-written with ATen tensor lib

Model that predicts the probability of a Twitter user being anti-vaccination.

Unified tracking framework with a single appearance model

Official implementation of NeuralFusion: Online Depth Map Fusion in Latent Space

A PyTorch-based R-YOLOv4 implementation which combines YOLOv4 model and loss function from R3Det for arbitrary oriented object detection.

Official code for 'Robust Siamese Object Tracking for Unmanned Aerial Manipulator' and offical introduction to UAMT100 benchmark

This repository contains the code for TACL2021 paper: SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

Classify the disease status of a plant given an image of a passion fruit