This is a JAX implementation of Neural Radiance Fields for learning purposes.

Last update: Dec 20, 2022

Related tags

Overview

learn-nerf

This is a JAX implementation of Neural Radiance Fields for learning purposes.

I've been curious about NeRF and its follow-up work for a while, but don't have much time to explore it. I learn best by doing, so I'll be implementing stuff here to try to get a feel for it.

Usage

The steps to using this codebase are as follows:

Generate a dataset - run a simple Go program to turn any .stl 3D model into a series of rendered camera views with associated metadata.
Train a model - install the Python dependencies and run the training script.
Render a novel view - render a novel view of the object using a model.

Generating a dataset

I use a simple format for storing rendered views of the scene. Each frame is stored as a PNG file, and each PNG has an accompanying JSON file describing the camera view.

For easy experimentation, I created a Go program to render an arbitrary .stl file as a collection of views in the supported data format. To run this program, install Go and run go get . inside of simple_dataset/ to get the dependencies. Next, run

$ go run . /path/to/model.stl data_dir

This will create a directory data_dir containing rendered views of /path/to/model.stl.

Training a model

First, install the learn_nerf package by running pip install -e . inside this repository. You should separately make sure jax and Flax are installed in your environment.

The training script is learn_nerf/scripts/train_nerf.py. Here's an example of running this script:

python learn_nerf/scripts/train_nerf.py \
    --lr 1e-5 \
    --batch_size 1024 \
    --save_path model_weights.pkl \
    /path/to/data_dir

This will periodically save model weights to model_weights.pkl. The script may get stuck on training... while it shuffles the dataset and compiles the training graph. Wait a minute or two, and losses should start printing out as training ramps up.

If you get a Segmentation fault on CPU, this may be because you don't have enough memory to run batch size 1024--try something lower.

Render a novel view

To render a view from a trained NeRF model, use learn_nerf/scripts/render_nerf.py. Here's an example of the usage:

python learn_nerf/scripts/render_nerf.py \
    --batch_size 1024 \
    --model_path model_weights.pkl \
    --width 128 \
    --height 128 \
    /path/to/data_dir/0000.json \
    output.png

In the above example, we will render the camera view described by /path/to/data_dir/0000.json. Note that the camera view can be from the training set, but doesn't need to be as long as its in the correct JSON format.

This is a JAX implementation of Neural Radiance Fields for learning purposes.

Related tags

Overview

learn-nerf

Usage

Generating a dataset

Training a model

Render a novel view

Owner

Alex Nichol

PyTorch Implementation of Vector Quantized Variational AutoEncoders.

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network)

Auto White-Balance Correction for Mixed-Illuminant Scenes

git《Self-Attention Attribution: Interpreting Information Interactions Inside Transformer》(AAAI 2021) GitHub:

Image-Scaling Attacks and Defenses

Hand gesture recognition model that can be used as a remote control for a smart tv.

[ACM MM 2021] Multiview Detection with Shadow Transformer (and View-Coherent Data Augmentation)

Weighted K Nearest Neighbors (kNN) algorithm implemented on python from scratch.

A new GCN model for Point Cloud Analyse

SCALE: Modeling Clothed Humans with a Surface Codec of Articulated Local Elements (CVPR 2021)

Adversarial Self-Defense for Cycle-Consistent GANs

Semi-Supervised Learning with Ladder Networks in Keras. Get 98% test accuracy on MNIST with just 100 labeled examples !

This is an official implementation for "PlaneRecNet".

Python calculations for the position of the sun and moon.

Spearmint Bayesian optimization codebase

Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and related datasets

Mahadi-Now - This Is Pakistani Just Now Login Tools

[AAAI 2021] EMLight: Lighting Estimation via Spherical Distribution Approximation and [ICCV 2021] Sparse Needlets for Lighting Estimation with Spherical Transport Loss