Official Pytorch Implementation of Unsupervised Image Denoising with Frequency Domain Knowledge

Last update: Sep 26, 2022

Related tags

Overview

Unsupervised Image Denoising with Frequency Domain Knowledge (BMVC 2021 Oral) : Official Project Page

This repository provides the official PyTorch implementation of the following paper:

Unsupervised Image Denoising with Frequency Domain Knowledge

Nahyun Kim* (KAIST), Donggon Jang* (KAIST), Sunhyeok Lee (KAIST), Bomi Kim (KAIST), and Dae-Shik Kim (KAIST) (*The authors have equally contributed.)

BMVC 2021, Accepted as Oral Paper.

Abstract: Supervised learning-based methods yield robust denoising results, yet they are inherently limited by the need for large-scale clean/noisy paired datasets. The use of unsupervised denoisers, on the other hand, necessitates a more detailed understanding of the underlying image statistics. In particular, it is well known that apparent differences between clean and noisy images are most prominent on high-frequency bands, justifying the use of low-pass filters as part of conventional image preprocessing steps. However, most learning-based denoising methods utilize only one-sided information from the spatial domain without considering frequency domain information. To address this limitation, in this study we propose a frequency-sensitive unsupervised denoising method. To this end, a generative adversarial network (GAN) is used as a base structure. Subsequently, we include spectral discriminator and frequency reconstruction loss to transfer frequency knowledge into the generator. Results using natural and synthetic datasets indicate that our unsupervised learning method augmented with frequency information achieves state-of-the-art denoising performance, suggesting that frequency domain information could be a viable factor in improving the overall performance of unsupervised learning-based methods.

Requirements

To install requirements:

conda env create -n [your env name] -f environment.yaml
conda activate [your env name]

To train the model

Synthetic Noise (AWGN)

Download DIV2K dataset for training in here
Randomly split the DIV2K dataset into Clean/Noisy set. Please refer the .txt files in split_data.
Place the splitted dataset(DIV2K_C and DIV2K_N) in ./dataset directory.

dataset
└─── DIV2K_C
└─── DIV2K_N
└─── test

Use gen_dataset_synthetic.py to package dataset in the h5py format.
After that, run this command:

sh ./scripts/train_awgn_sigma15.sh # AWGN with a noise level = 15
sh ./scripts/train_awgn_sigma25.sh # AWGN with a noise level = 25
sh ./scripts/train_awgn_sigma50.sh # AWGN with a noise level = 50

After finishing the training, .pth file is stored in ./exp/[exp_name]/[seed_number]/saved_models/ directory.

Real-World Noise

Download SIDD-Medium Dataset for training in here
Radnomly split the SIDD-Medium Dataset into Clean/Noisy set. Please refer the .txt files in split_data.
Place the splitted dataset(SIDD_C and SIDD_N) in ./dataset directory.

dataset
└─── SIDD_C
└─── SIDD_N
└─── test

Use gen_dataset_real.py to package dataset in the h5py format.
After that, run this command:

sh ./scripts/train_real.sh

After finishing the training, .pth file is stored in ./exp/[exp_name]/[seed_number]/saved_models/ directory.

To evaluate the model

Synthetic Noise (AWGN)

Download CBSD68 dataset for evaluation in here
Place the dataset in ./dataset/test directory.

dataset
└─── train
└─── test
     └─── CBSD68
     └─── SIDD_test

After that, run this command:

sh ./scripts/test_awgn_sigma15.sh # AWGN with a noise level = 15
sh ./scripts/test_awgn_sigma25.sh # AWGN with a noise level = 25
sh ./scripts/test_awgn_sigma50.sh # AWGN with a noise level = 50

Real-World Noise

Download the SIDD test dataset for evaluation in here
Place the dataset in ./dataset/test directory.

dataset
└─── train
└─── test
     └─── CBSD68
     └─── SIDD_test

After that, run this command:

sh ./scripts/test_real.sh

Pre-trained model

We provide pre-trained models in ./checkpoints directory.

checkpoints
|   AWGN_sigma15.pth # pre-trained model (AWGN with a noise level = 15)
|   AWGN_sigma25.pth # pre-trained model (AWGN with a noise level = 25)
|   AWGN_sigma50.pth # pre-trained model (AWGN with a noise level = 50)
|   SIDD.pth # pre-trained model (Real-World noise)

Acknowledgements

This code is built on U-GAT-IT,CARN, SSD-GAN. We thank the authors for sharing their codes.

Contact

If you have any questions, feel free to contact me ([email protected])

Official Pytorch Implementation of Unsupervised Image Denoising with Frequency Domain Knowledge

Related tags

Overview

Unsupervised Image Denoising with Frequency Domain Knowledge (BMVC 2021 Oral) : Official Project Page

Requirements

To train the model

Synthetic Noise (AWGN)

Real-World Noise

To evaluate the model

Synthetic Noise (AWGN)

Real-World Noise

Pre-trained model

Acknowledgements

Contact

Owner

Donggon Jang

Task-based end-to-end model learning in stochastic optimization

PyTorch code for Composing Partial Differential Equations with Physics-Aware Neural Networks

Leaderboard and Visualization for RLCard

A module that used for encrypt code which includes RSA and AES

DivNoising is an unsupervised denoising method to generate diverse denoised samples for any noisy input image. This repository contains the code to reproduce the results reported in the paper https://openreview.net/pdf?id=agHLCOBM5jP

[SIGGRAPH Asia 2021] Pose with Style: Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN

InferPy: Deep Probabilistic Modeling with Tensorflow Made Easy

Python Single Object Tracking Evaluation

This is the source code of the 1st place solution for segmentation task (with Dice 90.32%) in 2021 CCF BDCI challenge.

Google Landmark Recogntion and Retrieval 2021 Solutions

PyTorch Implementation of Vector Quantized Variational AutoEncoders.

Python code for the paper How to scale hyperparameters for quickshift image segmentation

NAACL2021 - COIL Contextualized Lexical Retriever

This code is a near-infrared spectrum modeling method based on PCA and pls

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Multi-task Self-supervised Object Detection via Recycling of Bounding Box Annotations (CVPR, 2019)

Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

PointCNN: Convolution On X-Transformed Points (NeurIPS 2018)

Python parser for DTED data.