Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis

Last update: Jan 07, 2023

Related tags

Overview

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis

The following results are obtained by our SCUNet with purely synthetic training data! We did not use the paired noisy/clean data by DND and SIDD during training!

Swin-Conv-UNet (SCUNet) denoising network

The architecture of the proposed Swin-Conv-UNet (SCUNet) denoising network. SCUNet exploits the swin-conv (SC) block as the main building block of a UNet backbone. In each SC block, the input is first passed through a 1×1 convolution, and subsequently is split evenly into two feature map groups, each of which is then fed into a swin transformer (SwinT) block and residual 3×3 convolutional (RConv) block, respectively; after that, the outputs of SwinT block and RConv block are concatenated and then passed through a 1×1 convolution to produce the residual of the input. “SConv” and “TConv” denote 2×2 strided convolution with stride 2 and 2×2 transposed convolution with stride 2, respectively.

New data synthesis pipeline for real image denoising

Schematic illustration of the proposed paired training patches synthesis pipeline. For a high quality image, a randomly shuffled degradation sequence is performed to produce a noisy image. Meanwhile, the resizing and reverse-forward tone mapping are performed to produce a corresponding clean image. A paired noisy/clean training patches are then cropped for training deep blind denoising model. Note that, since Poisson noise is signal-dependent, the dashed arrow for “Poisson” means the clean image is used to generate the Poisson noise. To tackle with the color shift issue, the dashed arrow for “Camera Sensor” means the reverse-forward tone mapping is performed on the clean image.

Synthesized noisy/clean patch pairs via our proposed training data synthesis pipeline. The size of the high quality image patch is 544×544. The size of the noisy/clean patches is 128×128.

Web Demo

Try Replicate web demo for SCUNet models here

Codes

Download SCUNet models

python main_download_pretrained_models.py --models "SCUNet" --model_dir "model_zoo"

Gaussian denoising

grayscale images

python main_test_scunet_gray_gaussian.py --model_name scunet_gray_25 --noise_level_img 25 --testset_name set12

color images

python main_test_scunet_color_gaussian.py --model_name scunet_color_25 --noise_level_img 25 --testset_name bsd68

Blind real image denoising

python main_test_scunet_real_application.py --model_name scunet_color_real_psnr --testset_name real3

Results on Gaussian denoising

Results on real image denoising

@article{zhang2022practical,
title={Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis},
author={Zhang, Kai and Li, Yawei and Liang, Jingyun and Cao, Jiezhang and Zhang, Yulun and Tang, Hao and Timofte, Radu and Van Gool, Luc},
journal={arXiv preprint},
year={2022}
}

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis

Related tags

Overview

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis

Swin-Conv-UNet (SCUNet) denoising network

New data synthesis pipeline for real image denoising

Web Demo

Codes

Results on Gaussian denoising

Results on real image denoising

Owner

Kai Zhang

This repository contains the code for the paper "Hierarchical Motion Understanding via Motion Programs"

MediaPipeのPythonパッケージのサンプルです。2020/12/11時点でPython実装のある4機能(Hands、Pose、Face Mesh、Holistic)について用意しています。

免费获取http代理并生成proxifier配置文件

Get 2D point positions (e.g., facial landmarks) projected on 3D mesh

LibMTL: A PyTorch Library for Multi-Task Learning

Install alphafold on the local machine, get out of docker.

Functional TensorFlow Implementation of Singular Value Decomposition for paper Fast Graph Learning

Graph-Refined Convolutional Network for Multimedia Recommendation with Implicit Feedback

Implementations of CNNs, RNNs, GANs, etc

Human Pose Detection on EdgeTPU

Dense Deep Unfolding Network with 3D-CNN Prior for Snapshot Compressive Imaging, ICCV2021 [PyTorch Code]

Tello Drone Trajectory Tracking

Source code of our work: "Benchmarking Deep Models for Salient Object Detection"

Deconfounding Temporal Autoencoder: Estimating Treatment Effects over Time Using Noisy Proxies

Convert human motion from video to .bvh

PyTorch implementation of a collections of scalable Video Transformer Benchmarks.

a Lightweight library for sequential learning agents, including reinforcement learning

NeurIPS 2021 paper 'Representation Learning on Spatial Networks' code

PyTorch implementation for "Mining Latent Structures with Contrastive Modality Fusion for Multimedia Recommendation"

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]