VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Last update: Dec 26, 2022

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

3D-aware Image Synthesis via Learning Structural and Textural Representations
Yinghao Xu, Sida Peng, Ceyuan Yang, Yujun Shen, Bolei Zhou
arXiv preprint arXiv:

[Paper] [Project Page] [Demo]

This paper aims at achieving high-fidelity 3D-aware images synthesis. We propose a novel framework, termed as VolumeGAN, for synthesizing images under different camera views, through explicitly learning a structural representation and a textural representation. We first learn a feature volume to represent the underlying structure, which is then converted to a feature field using a NeRF-like model. The feature field is further accumulated into a 2D feature map as the textural representation, followed by a neural renderer for appearance synthesis. Such a design enables independent control of the shape and the appearance. Extensive experiments on a wide range of datasets show that our approach achieves sufficiently higher image quality and better 3D control than the previous methods.

Qualitative Results

Independent control of structure (shape) and texture (appearance).

Comparison to prior work on various datasets.

Code Coming Soon

BibTeX

@article{xu2021volumegan,
  title   = {3D-aware Image Synthesis via Learning Structural and Textural Representations},
  author  = {Xu, Yinghao and Peng, Sida and Yang, Ceyuan and Shen, Yujun and Zhou, Bolei},
  article = {arXiv preprint arXiv:2112.10759},
  year    = {2021}
}

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Qualitative Results

Code Coming Soon

BibTeX

Owner

GenForce: May Generative Force Be with You

This project aim to create multi-label classification annotation tool to boost annotation speed and make it more easier.

DeepStochlog Package For Python

[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion

Scalable Graph Neural Networks for Heterogeneous Graphs

AdamW optimizer for bfloat16 models in pytorch.

Spectral Temporal Graph Neural Network (StemGNN in short) for Multivariate Time-series Forecasting

Code for ACL2021 paper Consistency Regularization for Cross-Lingual Fine-Tuning.

Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)

Is RobustBench/AutoAttack a suitable Benchmark for Adversarial Robustness?

Mixed Transformer UNet for Medical Image Segmentation

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

A Python training and inference implementation of Yolov5 helmet detection in Jetson Xavier nx and Jetson nano

Kindle is an easy model build package for PyTorch.

Replication package for the manuscript "Using Personality Detection Tools for Software Engineering Research: How Far Can We Go?" submitted to TOSEM

Another pytorch implementation of FCN (Fully Convolutional Networks)

Data manipulation and transformation for audio signal processing, powered by PyTorch

The official codes for the ICCV2021 Oral presentation "Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework"

MMdet2-based reposity about lightweight detection model: Nanodet, PicoDet.

Assginment for UofT CSC420: Intro to Image Understanding

Hand-distance-measurement-game - Hand Distance Measurement Game