Breaking the Dilemma of Medical Image-to-image Translation

Supervised Pix2Pix and unsupervised Cycle-consistency are two modes that dominate the field of medical image-to-image translation. However, neither modes are ideal. The Pix2Pix mode has excellent performance. But it requires paired and well pixel-wise aligned images, which may not always be achievable due to respiratory motion or anatomy change between times that paired images are acquired. The Cycle-consistency mode is less stringent with training data and works well on unpaired or misaligned images. But its performance may not be optimal. In order to break the dilemma of the existing modes, we propose a new unsupervised mode called RegGAN for medical image-to-image translation. It is based on the theory of "loss-correction". In RegGAN, the misaligned target images are considered as noisy labels and the generator is trained with an additional registration network to fit the misaligned noise distribution adaptively. The goal is to search for the common optimal solution to both image-to-image translation and registration tasks. We incorporated RegGAN into a few state-of-the-art image-to-image translation methods and demonstrated that RegGAN could be easily combined with these methods to improve their performances. Such as a simple CycleGAN in our mode surpasses latest NICEGAN even though using less network parameters. Based on our results, RegGAN outperformed both Pix2Pix on aligned data and Cycle-consistency on misaligned or unpaired data. RegGAN is insensitive to noises which makes it a better choice for a wide range of scenarios, especially for medical image-to-image translation tasks in which well pixel-wise aligned data are not available

This paper has been accepted by NeurIPS 2021. Get the full paper on Arxiv.

Breaking the Dilemma of Medical Image-to-image Translation

Related tags

Overview

Breaking the Dilemma of Medical Image-to-image Translation

Owner

Kid Liet

Codebase for arXiv preprint "NeRF++: Analyzing and Improving Neural Radiance Fields"

Experiments and examples converting Transformers to ONNX

A hybrid framework (neural mass model + ML) for SC-to-FC prediction

DeepProbLog is an extension of ProbLog that integrates Probabilistic Logic Programming with deep learning by introducing the neural predicate.

PyTorch implementation for View-Guided Point Cloud Completion

🎁 3,000,000+ Unsplash images made available for research and machine learning

(SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’

mmdetection version of TinyBenchmark.

Compute descriptors for 3D point cloud registration using a multi scale sparse voxel architecture

PrimitiveNet: Primitive Instance Segmentation with Local Primitive Embedding under Adversarial Metric (ICCV 2021)

Solving Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge

Generative Query Network (GQN) in PyTorch as described in "Neural Scene Representation and Rendering"

AI Flow is an open source framework that bridges big data and artificial intelligence.

Python codes for Lite Audio-Visual Speech Enhancement.

This repo includes the CUB-GHA (Gaze-based Human Attention) dataset and code of the paper "Human Attention in Fine-grained Classification".

Example how to deploy deep learning model with aiohttp.

A lightweight library designed to accelerate the process of training PyTorch models by providing a minimal

PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)

PyTorch implementation of SCAFFOLD (Stochastic Controlled Averaging for Federated Learning, ICML 2020).

Graph Representation Learning via Graphical Mutual Information Maximization