AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)

Last update: Jan 03, 2023

Overview

AOT-GAN for High-Resolution Image Inpainting

Arxiv Paper |

AOT-GAN: Aggregated Contextual Transformations for High-Resolution Image Inpainting
Yanhong Zeng, Jianlong Fu, Hongyang Chao, and Baining Guo.

Citation

If any part of our paper and code is helpful to your work, please generously cite and star us 😘 😘 😘 !

@inproceedings{yan2021agg,
  author = {Zeng, Yanhong and Fu, Jianlong and Chao, Hongyang and Guo, Baining},
  title = {Aggregated Contextual Transformations for High-Resolution Image Inpainting},
  booktitle = {Arxiv},
  pages={-},
  year = {2020}
}

Introduction

Despite some promising results, it remains challenging for existing image inpainting approaches to fill in large missing regions in high resolution images (e.g., 512x512). We analyze that the difﬁculties mainly drive from simultaneously inferring missing contents and synthesizing fine-grained textures for a extremely large missing region. We propose a GAN-based model that improves performance by,

Enhancing context reasoning by AOT Block in the generator. The AOT blocks aggregate contextual transformations with different receptive fields, allowing to capture both informative distant contexts and rich patterns of interest for context reasoning.
Enhancing texture synthesis by SoftGAN in the discriminator. We improve the training of the discriminator by a tailored mask-prediction task. The enhanced discriminator is optimized to distinguish the detailed appearance of real and synthesized patches, which can in turn facilitate the generator to synthesize more realistic textures.

Results

Prerequisites

python 3.8.8
pytorch (tested on Release 1.8.1)

Installation

Clone this repo.

git clone [email protected]:researchmm/AOT-GAN-for-Inpainting.git
cd AOT-GAN-for-Inpainting/

For the full set of required Python packages, we suggest create a Conda environment from the provided YAML, e.g.

conda env create -f environment.yml 
conda activate inpainting

Datasets

download images and masks
specify the path to training data by --dir_image and --dir_mask.

Getting Started

Training:
- Our codes are built upon distributed training with Pytorch.
- Run
```
cd src 
python train.py  
```
Resume training:
```
cd src
python train.py --resume 
```

Testing:

cd src 
python test.py --pre_train [path to pretrained model]

Evaluating:

cd src 
python eval.py --real_dir [ground truths] --fake_dir [inpainting results] --metric mae psnr ssim fid

Pretrained models

CELEBA-HQ | Places2

Download the model dirs and put it under experiments/

Demo

Download the pre-trained model parameters and put it under experiments/
Run by

cd src
python demo.py --dir_image [folder to images]  --pre_train [path to pre_trained model] --painter [bbox|freeform]

Press '+' or '-' to control the thickness of painter.
Press 'r' to reset mask; 'k' to keep existing modifications; 's' to save results.
Press space to perform inpainting; 'n' to move to next image; 'Esc' to quit demo.

TensorBoard

Visualization on TensorBoard for training is supported.

Run tensorboard --logdir [log_folder] --bind_all and open browser to view training progress.

Acknowledgements

We would like to thank edge-connect, EDSR_PyTorch.

AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)

Related tags

Overview

AOT-GAN for High-Resolution Image Inpainting

Arxiv Paper |

Citation

Introduction

Results

Prerequisites

Installation

Datasets

Getting Started

Pretrained models

Demo

TensorBoard

Acknowledgements

Owner

Multimedia Research

Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)

Deformable DETR is an efficient and fast-converging end-to-end object detector.

This repository contains the code for TACL2021 paper: SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

Vignette is a face tracking software for characters using osu!framework.

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022

Multi-Scale Geometric Consistency Guided Multi-View Stereo

Pytorch implementation of OCNet series and SegFix.

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

Official repository for the CVPR 2021 paper "Learning Feature Aggregation for Deep 3D Morphable Models"

The implement of papar "Enhanced Graph Learning for Collaborative Filtering via Mutual Information Maximization"

MaRS - a recursive filtering framework that allows for truly modular multi-sensor integration

pyspark🍒🥭 is delicious，just eat it!😋😋

This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure recognition.

FaceAPI: AI-powered Face Detection & Rotation Tracking, Face Description & Recognition, Age & Gender & Emotion Prediction for Browser and NodeJS using TensorFlow/JS

A script helps the user to update Linux and Mac systems through the terminal

PyTorch implementation of Barlow Twins.

A Python Package For System Identification Using NARMAX Models

SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event Data

Monitora la qualità della ricezione dei segnali radio nelle province siciliane.

transfer attack; adversarial examples; black-box attack; unrestricted Adversarial Attacks on ImageNet; CVPR2021 天池黑盒竞赛