VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Last update: Dec 26, 2022

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

3D-aware Image Synthesis via Learning Structural and Textural Representations
Yinghao Xu, Sida Peng, Ceyuan Yang, Yujun Shen, Bolei Zhou
arXiv preprint arXiv:

[Paper] [Project Page] [Demo]

This paper aims at achieving high-fidelity 3D-aware images synthesis. We propose a novel framework, termed as VolumeGAN, for synthesizing images under different camera views, through explicitly learning a structural representation and a textural representation. We first learn a feature volume to represent the underlying structure, which is then converted to a feature field using a NeRF-like model. The feature field is further accumulated into a 2D feature map as the textural representation, followed by a neural renderer for appearance synthesis. Such a design enables independent control of the shape and the appearance. Extensive experiments on a wide range of datasets show that our approach achieves sufficiently higher image quality and better 3D control than the previous methods.

Qualitative Results

Independent control of structure (shape) and texture (appearance).

Comparison to prior work on various datasets.

Code Coming Soon

BibTeX

@article{xu2021volumegan,
  title   = {3D-aware Image Synthesis via Learning Structural and Textural Representations},
  author  = {Xu, Yinghao and Peng, Sida and Yang, Ceyuan and Shen, Yujun and Zhou, Bolei},
  article = {arXiv preprint arXiv:2112.10759},
  year    = {2021}
}

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Qualitative Results

Code Coming Soon

BibTeX

Owner

GenForce: May Generative Force Be with You

Efficient Two-Step Networks for Temporal Action Segmentation (Neurocomputing 2021)

Code for Efficient Visual Pretraining with Contrastive Detection

NeurIPS 2021 paper 'Representation Learning on Spatial Networks' code

Supplemental learning materials for "Fourier Feature Networks and Neural Volume Rendering"

Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference

Disturbing Target Values for Neural Network regularization: attacking the loss layer to prevent overfitting

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

The Unsupervised Reinforcement Learning Benchmark (URLB)

HIVE: Evaluating the Human Interpretability of Visual Explanations

Autonomous Ground Vehicle Navigation and Control Simulation Examples in Python

The code used for the free [email protected] Webinar series on Reinforcement Learning in Finance

基于tensorflow 2.x的图片识别工具集

PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP. Democratize AI for everyone.

Face Recognize System on camera AI OAK1

A lightweight Python-based 3D network multi-agent simulator. Uses a cell-based congestion model. Calculates risk, loudness and battery capacities of the agents. Suitable for 3D network optimization tasks.

Codecov coverage standard for Python

Zsseg.baseline - Zero-Shot Semantic Segmentation

SegTransVAE: Hybrid CNN - Transformer with Regularization for medical image segmentation

AdaNet is a lightweight TensorFlow-based framework for automatically learning high-quality models with minimal expert intervention

Official Pytorch implementation for "End2End Occluded Face Recognition by Masking Corrupted Features, TPAMI 2021"