EqGAN - Improving GAN Equilibrium by Raising Spatial Awareness

Improving GAN Equilibrium by Raising Spatial Awareness
Jianyuan Wang, Ceyuan Yang, Yinghao Xu, Yujun Shen, Hongdong Li, Bolei Zhou
arXiv preprint

[Paper] [Project Page] [Demo]

In Generative Adversarial Networks (GANs), a generator (G) and a discriminator (D) are expected to reach a certain equilibrium where D cannot distinguish the generated images from the real ones. However, in practice it is difficult to achieve such an equilibrium in GAN training, instead, D almost always surpasses G. We attribute this phenomenon to the information asymmetry that D learns its own visual attention when determining whether an image is real or fake, but G has no explicit clue on which regions to focus on.

To alleviate the issue of D dominating the competition in GANs, we aim to raise the spatial awareness of G. We encode randomly sampled multi-level heatmaps into the intermediate layers of G as an inductive bias. We further propose to align the spatial awareness of G with the attention map induced from D. Through this way we effectively lessen the information gap between D and G. Extensive results show that our method pushes the two-player game in GANs closer to the equilibrium, leading to a better synthesis performance. As a byproduct, the introduced spatial awareness facilitates interactive editing over the output synthesis.

BibTeX

@article{wang2021eqgan,
  title   = {Improving GAN Equilibrium by Raising Spatial Awareness},
  author  = {Wang, Jianyuan and Yang, Ceyuan and Xu, Yinghao and Shen, Yujun and Li, Hongdong and Zhou, Bolei},
  article = {arXiv preprint},
  year    = {2021}
}

EqGAN - Improving GAN Equilibrium by Raising Spatial Awareness

Related tags

Overview

EqGAN - Improving GAN Equilibrium by Raising Spatial Awareness

BibTeX

Owner

GenForce: May Generative Force Be with You

Transformer Tracking (CVPR2021)

Tensorboard for pytorch (and chainer, mxnet, numpy, ...)

Multi-Horizon-Forecasting-for-Limit-Order-Books

Multi-agent reinforcement learning algorithm and environment

[NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.

A lightweight face-recognition toolbox and pipeline based on tensorflow-lite

MARS: Learning Modality-Agnostic Representation for Scalable Cross-media Retrieva

ivadomed is an integrated framework for medical image analysis with deep learning.

Hybrid Neural Fusion for Full-frame Video Stabilization

[CVPR'21] FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space

Naszilla is a Python library for neural architecture search (NAS)

IsoGCN code for ICLR2021

sense-py-AnishaBaishya created by GitHub Classroom

PyTorch implementation for paper "Full-Body Visual Self-Modeling of Robot Morphologies".

Whisper is a file-based time-series database format for Graphite.

Generate images from texts. In Russian

Multi-Stage Progressive Image Restoration

This is the official source code of "BiCAT: Bi-Chronological Augmentation of Transformer for Sequential Recommendation".

SuMa++: Efficient LiDAR-based Semantic SLAM (Chen et al IROS 2019)

The repository contain code for building compiler using puthon.