Implementation for HFGI: High-Fidelity GAN Inversion for Image Attribute Editing

Last update: Dec 30, 2022

Overview

HFGI: High-Fidelity GAN Inversion for Image Attribute Editing

High-Fidelity GAN Inversion for Image Attribute Editing

Update: We released the inference code and the pre-trained model on Oct. 31. The training code is coming soon.

Introduction

We present a novel high-fidelity GAN inversion framework that enables attribute editing with image-specific details well-preserved (e.g., background, appearance and illumination).

To Do

Release the inference code
Release the pretrained model
Release the training code (upon approval)

Set up

Installation

git clone https://github.com/Tengfei-Wang/HFGI.git
cd HFGI

Environment

The environment can be simply set up by Anaconda (only tested for inference):

conda create -n HFGI python=3.7
conda activate HFGI
pip install torch==1.6.0+cu101 torchvision==0.7.0+cu101 -f https://download.pytorch.org/whl/torch_stable.html
pip install matplotlib
conda install ninja
conda install -c 3dhubs gcc-5

Or, you can also set up the environment from the provided environment.yml:

conda env create -f environment.yml

Quick Start

Pretrained Models

Please download our pre-trained model and put it in ./checkpoint.

Model	Description
Face Editing	Trained on FFHQ.

Prepare Images

We put some images from CelebA-HQ in ./test_imgs, and you can quickly try them (and other images from CelebA-HQ or FFHQ).
For customized images, it is encouraged to first pre-process (align & crop) them, and then edit with our model. See FFHQ for alignment details.

Inference

Modify inference.sh according to the follwing instructions, and run:
(It is possibly slow for the first-time running.)

bash inference.sh

Args	Description
--images_dir	the path of images.
--n_sample	number of images that you want to infer.
--edit_attribute	We provide options of 'inversion', 'age', 'smile', 'eyes', 'lip' and 'beard' in the script.
--edit_degree	control the degree of editing (works for 'age' and 'smile').

Training

Coming soon

Video Editing

The source videos and edited results in our paper can be found in this link.
For video editing, we first pre-process (align & crop) each frame, and then perform editing with the pre-trained model.

More Results

Citation

If you find this work useful for your research, please cite:

@article{wang2021HFGI,
      author = {Tengfei Wang and Yong Zhang and Yanbo Fan and Jue Wang and Qifeng Chen},
      title = {High-Fidelity GAN Inversion for Image Attribute Editing}, 
      journal = {arxiv:2109.06590},  
      year = {2021}
}

Implementation for HFGI: High-Fidelity GAN Inversion for Image Attribute Editing

Related tags

Overview

HFGI: High-Fidelity GAN Inversion for Image Attribute Editing

Introduction

To Do

Set up

Installation

Environment

Quick Start

Pretrained Models

Prepare Images

Inference

Training

Video Editing

More Results

Citation

Owner

Tengfei Wang

A little Python application to auto tag your photos with the power of machine learning.

A pure PyTorch implementation of the loss described in "Online Segment to Segment Neural Transduction"

Pytorch GUI(demo) for iVOS(interactive VOS) and GIS (Guided iVOS)

Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [CVPR 2021]

Tool for live presentations using manim

[AAAI 2022] Sparse Structure Learning via Graph Neural Networks for Inductive Document Classification

The official implementation of the paper, "SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning"

An Industrial Grade Federated Learning Framework

Official codebase for Legged Robots that Keep on Learning: Fine-Tuning Locomotion Policies in the Real World

Instance-wise Feature Importance in Time (FIT)

Graph Robustness Benchmark: A scalable, unified, modular, and reproducible benchmark for evaluating the adversarial robustness of Graph Machine Learning.

fklearn: Functional Machine Learning

Deep Probabilistic Programming Course @ DIKU

Cross-Task Consistency Learning Framework for Multi-Task Learning

AI grand challenge 2020 Repo (Speech Recognition Track)

A self-supervised 3D representation learning framework named viewpoint bottleneck.

Official repository for the paper F, B, Alpha Matting

Attention-based Transformation from Latent Features to Point Clouds (AAAI 2022)

Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds (Local-Lip)

Fast and robust clustering of point clouds generated with a Velodyne sensor.