Breaking the Dilemma of Medical Image-to-image Translation

Supervised Pix2Pix and unsupervised Cycle-consistency are two modes that dominate the field of medical image-to-image translation. However, neither modes are ideal. The Pix2Pix mode has excellent performance. But it requires paired and well pixel-wise aligned images, which may not always be achievable due to respiratory motion or anatomy change between times that paired images are acquired. The Cycle-consistency mode is less stringent with training data and works well on unpaired or misaligned images. But its performance may not be optimal. In order to break the dilemma of the existing modes, we propose a new unsupervised mode called RegGAN for medical image-to-image translation. It is based on the theory of "loss-correction". In RegGAN, the misaligned target images are considered as noisy labels and the generator is trained with an additional registration network to fit the misaligned noise distribution adaptively. The goal is to search for the common optimal solution to both image-to-image translation and registration tasks. We incorporated RegGAN into a few state-of-the-art image-to-image translation methods and demonstrated that RegGAN could be easily combined with these methods to improve their performances. Such as a simple CycleGAN in our mode surpasses latest NICEGAN even though using less network parameters. Based on our results, RegGAN outperformed both Pix2Pix on aligned data and Cycle-consistency on misaligned or unpaired data. RegGAN is insensitive to noises which makes it a better choice for a wide range of scenarios, especially for medical image-to-image translation tasks in which well pixel-wise aligned data are not available

This paper has been accepted by NeurIPS 2021. Get the full paper on Arxiv.

Breaking the Dilemma of Medical Image-to-image Translation

Related tags

Overview

Breaking the Dilemma of Medical Image-to-image Translation

Owner

Kid Liet

Free-duolingo-plus - Duolingo account creator that uses your invite code to get you free duolingo plus

A working implementation of the Categorical DQN (Distributional RL).

SuMa++: Efficient LiDAR-based Semantic SLAM (Chen et al IROS 2019)

SpeechNAS Better Trade off between Latency and Accuracy for Large Scale Speaker Verification

Code for the paper "M2m: Imbalanced Classification via Major-to-minor Translation" (CVPR 2020)

TextureGAN in Pytorch

SimDeblur is a simple framework for image and video deblurring, implemented by PyTorch

Detectron2 for Document Layout Analysis

Wind Speed Prediction using LSTMs in PyTorch

PyTorch implementation of the YOLO (You Only Look Once) v2

Workshop Materials Delivered on 28/02/2022

Learning Features with Parameter-Free Layers (ICLR 2022)

Generative Handwriting using LSTM Mixture Density Network with TensorFlow

Code for reproducing experiments in "Improved Training of Wasserstein GANs"

This framework implements the data poisoning method found in the paper Adversarial Examples Make Strong Poisons

Simple, efficient and flexible vision toolbox for mxnet framework.

This repository is the official implementation of the Hybrid Self-Attention NEAT algorithm.

Code for "Intra-hour Photovoltaic Generation Forecasting based on Multi-source Data and Deep Learning Methods."

Revealing and Protecting Labels in Distributed Training

TensorFlow implementation of the paper "Hierarchical Attention Networks for Document Classification"