From Fidelity to Perceptual Quality: A Semi-Supervised Approach for Low-Light Image Enhancement (CVPR'2020)

Last update: Jan 03, 2023

Related tags

Deep Learning CVPR-2020-Semi-Low-Light

Overview

From Fidelity to Perceptual Quality: A Semi-Supervised Approach for Low-Light Image Enhancement (CVPR'2020)

Wenhan Yang, Shiqi Wang, Yuming Fang, Yue Wang and Jiaying Liu

[Paper Link] [Project Page] [Slides](TBA)[Video](TBA) (CVPR'2020 Poster)

Abstract

Under-exposure introduces a series of visual degradation, i.e. decreased visibility, intensive noise, and biased color, etc. To address these problems, we propose a novel semi-supervised learning approach for low-light image enhancement. A deep recursive band network (DRBN) is proposed to recover a linear band representation of an enhanced normal-light image with paired low/normal-light images, and then obtain an improved one by recomposing the given bands via another learnable linear transformation based on a perceptual quality-driven adversarial learning with unpaired data. The architecture is powerful and flexible to have the merit of training with both paired and unpaired data. On one hand, the proposed network is well designed to extract a series of coarse-to-fine band representations, whose estimations are mutually beneficial in a recursive process. On the other hand, the extracted band representation of the enhanced image in the first stage of DRBN (recursive band learning) bridges the gap between the restoration knowledge of paired data and the perceptual quality preference to real high-quality images. Its second stage (band recomposition) learns to recompose the band representation towards fitting perceptual properties of highquality images via adversarial learning. With the help of this two-stage design, our approach generates the enhanced results with well reconstructed details and visually promising contrast and color distributions. Extensive evaluations demonstrate the superiority of our DRBN.

If you find the resource useful, please cite the following :- )

@InProceedings{Yang_2020_CVPR,
author = {Yang, Wenhan and Wang, Shiqi and Fang, Yuming and Wang, Yue and Liu, Jiaying},
title = {From Fidelity to Perceptual Quality: A Semi-Supervised Approach for Low-Light Image Enhancement},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2020}
}

Installation:

Clone this repo
Install PyTorch and dependencies from http://pytorch.org
For stage II training, you need to download [VGG16 Model] and put it in DRBL-stage2/src/.
For testing, you can directly run test.sh in DRBL-stage1/src/ and DRBL-stage2/src/.
For training, you can directly run train.sh in DRBL-stage1/src/ and DRBL-stage2/src/.
You can download our dataset here: [Dataset Link] (extracted code: 22im) [Partly updated on 27 March]

（Note: the code is suitable for PyTorch 0.4.1）

Detailed Guidance:

Thank you for your attention!

How could I reproduce the objective evaluation results in Table I in the paper？
You can run sh ./DRBL-stage1/src/test.sh
The 1st stage offers better objective results while the other produces better overall subjective visual quality. In our paper, the methods involved in objective comparisons are not trained with adversarial/quality losses.
Data structure You can see src\data\lowlight.py and src\data\lowlighttest.py for those details in the code of each stage.

In the 1st stage:
hr --> normal-light images, lr --> low-light images
lr and hr are paired.

In the 2nd stage:
hr --> normal-light images, lr --> low-light images
lr and hr are paired.
lrr --> low-light images in the real applications, hq --> high quality dataset
Dataset You can obtain the dataset via: [Dataset Link] (extracted code: 22im) [Partly updated on 27 March]
We introduce these collections here:
a) Our_low: real captured low-light images in LOL for training;
b) Our_normal: real captured normal-light images in LOL for training;
c) Our_low_test: real captured low-light images in LOL for testing;
d) Our_normal_test: real captured normal-light images in LOL for testing;
e) AVA_good_2: the high-quality images selected from the AVA dataset based on the MOS values;
f) Low_real_test_2_rs: real low-light images selected from LIME, NPE, VV, DICM, the typical unpaired low-light testing datasets;
g) Low_degraded: synthetic low-light images in LOL for training;
h) Normal: synthetic normal-light images in LOL for training;
Image number in LOL
LOL: Chen Wei, Wenjing Wang, Wenhan Yang, and Jiaying Liu. "Deep Retinex Decomposition for Low-Light Enhancement", BMVC, 2018. [Baiduyun (extracted code: sdd0)] [Google Drive]
LOL-v2 (the extension work): Wenhan Yang, Haofeng Huang, Wenjing Wang, Shiqi Wang, and Jiaying Liu. "Sparse Gradient Regularized Deep Retinex Network for Robust Low-Light Image Enhancement", TIP, 2021. [Baiduyun (extracted code: l9xm)] [Google Drive]

We use LOL-v2 as it is larger and more diverse. In fact, it is quite unexpected that the work of LOL-v2 is published later than this, which might also bother followers.

I think you can choose which one to follow freely.
Pytorch version
Only 0.4 and 0.41 currently.
If you have to use more advanced versions, which might be constrained to the GPU device types, you might access Wang Hong's github for the idea to replace parts of the dataloader: [New Dataloader]
Why does stage 2 have two branches?
The distributions of LOL and LIME, NPE, VV, DICM are quite different.
We empirically found that it will lead to better performance if two models and the corresponding training data are adopted.

Contact

If you have questions, you can contact [email protected]. A timely response is promised, if the email is sent by your affliaton email with your signed name.

From Fidelity to Perceptual Quality: A Semi-Supervised Approach for Low-Light Image Enhancement (CVPR'2020)

Related tags

Overview

From Fidelity to Perceptual Quality: A Semi-Supervised Approach for Low-Light Image Enhancement (CVPR'2020)

Abstract

If you find the resource useful, please cite the following :- )

Installation:

Detailed Guidance:

Contact

Owner

Yang Wenhan

Implementation of "Distribution Alignment: A Unified Framework for Long-tail Visual Recognition"(CVPR 2021)

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

Learning trajectory representations using self-supervision and programmatic supervision.

Libraries, tools and tasks created and used at DeepMind Robotics.

This project provides the code and datasets for 'CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection', CVPR 2019.

Control-Robot-Arm-using-PS4-Controller - A Robotic Arm based on Raspberry Pi and Arduino that controlled by PS4 Controller

An AI Assistant More Than a Toolkit

Image restoration with neural networks but without learning.

Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch

Simple, but essential Bayesian optimization package

Sum-Product Probabilistic Language

Contrastively Disentangled Sequential Variational Audoencoder

ArcaneGAN by Alex Spirin

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

Real-time Object Detection for Streaming Perception, CVPR 2022

Deep learning (neural network) based remote photoplethysmography: how to extract pulse signal from video using deep learning tools

SiT: Self-supervised vIsion Transformer

Summary of related papers on visual attention

Integrated Semantic and Phonetic Post-correction for Chinese Speech Recognition

(CVPR2021) ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic