PyTorch Implementation of "Light Field Image Super-Resolution with Transformers"

Last update: Nov 28, 2022

Related tags

Deep Learning LFT

Overview

LFT

PyTorch implementation of "Light Field Image Super-Resolution with Transformers", arXiv 2021. [pdf].

Contributions:

We make the first attempt to adapt Transformers to LF image processing, and propose a Transformer-based network for LF image SR.
We propose a novel paradigm (i.e., angular and spatial Transformers) to incorporate angular and spatial information in an LF.
With a small model size and low computational cost, our LFT achieves superior SR performance than other state-of-the-art methods.

Codes and Models:

Requirement

PyTorch 1.3.0, torchvision 0.4.1. The code is tested with python=3.6, cuda=9.0.
Matlab (For training/test data generation and performance evaluation)

Datasets

We used the EPFL, HCInew, HCIold, INRIA and STFgantry datasets for both training and test. Please first download our dataset via Baidu Drive (key:7nzy) or OneDrive, and place the 5 datasets to the folder ./datasets/.

Train

Run Generate_Data_for_Training.m to generate training data. The generated data will be saved in ./data_for_train/ (SR_5x5_2x, SR_5x5_4x).

Run train.py to perform network training. Example for training LFT on 5x5 angular resolution for 4x/2xSR:

$ python train.py --model_name LFT --angRes 5 --scale_factor 4 --batch_size 4
$ python train.py --model_name LFT --angRes 5 --scale_factor 2 --batch_size 8

Checkpoint will be saved to ./log/.

Test

Run Generate_Data_for_Test.m to generate test data. The generated data will be saved in ./data_for_test/ (SR_5x5_2x, SR_5x5_4x).

Run test.py to perform network inference. Example for test LFT on 5x5 angular resolution for 4x/2xSR:

python test.py --model_name LFT --angRes 5 --scale_factor 4 \ 
--use_pre_pth True --path_pre_pth './pth/LFT_5x5_4x_epoch_50_model.pth

python test.py --model_name LFT --angRes 5 --scale_factor 2 \ 
--use_pre_pth True --path_pre_pth './pth/LFT_5x5_2x_epoch_50_model.pth

The PSNR and SSIM values of each dataset will be saved to ./log/.

Results:

Quantitative Results

Efficiency

Visual Comparisons

Angular Consistency

Spatial-Aware Angular Modeling

Citiation

If you find this work helpful, please consider citing:

@Article{LFT,
    author    = {Liang, Zhengyu and Wang, Yingqian and Wang, Longguang and Yang, Jungang and Zhou, Shilin},
    title     = {Light Field Image Super-Resolution with Transformers},
    journal   = {arXiv preprint},
    month     = {August},
    year      = {2021},   
}

Contact

Any question regarding this work can be addressed to [email protected].

PyTorch Implementation of "Light Field Image Super-Resolution with Transformers"

Related tags

Overview

LFT

PyTorch implementation of "Light Field Image Super-Resolution with Transformers", arXiv 2021. [pdf].

Contributions:

Codes and Models:

Requirement

Datasets

Train

Test

Results:

Citiation

Contact

Owner

Squidward

[CVPR 2021] Official PyTorch Implementation for "Iterative Filter Adaptive Network for Single Image Defocus Deblurring"

An open source bike computer based on Raspberry Pi Zero (W, WH) with GPS and ANT+. Including offline map and navigation.

Python package provinding tools for artistic interactive applications using AI

Official implementation of the paper Chunked Autoregressive GAN for Conditional Waveform Synthesis

Text and code for the forthcoming second edition of Think Bayes, by Allen Downey.

Pytorch implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

Investigating Attention Mechanism in 3D Point Cloud Object Detection (arXiv 2021)

The code from the paper Character Transformations for Non-Autoregressive GEC Tagging

Cross View SLAM

Planar Prior Assisted PatchMatch Multi-View Stereo

This is the code repository for the paper "Identification of the Generalized Condorcet Winner in Multi-dueling Bandits" (NeurIPS 2021).

TensorFlow Metal Backend on Apple Silicon Experiments (just for fun)

we propose EfficientDerain for high-efficiency single-image deraining

codebase for "A Theory of the Inductive Bias and Generalization of Kernel Regression and Wide Neural Networks"

DeepStochlog Package For Python

Tackling the Class Imbalance Problem of Deep Learning Based Head and Neck Organ Segmentation

Focal and Global Knowledge Distillation for Detectors

Gems & Holiday Package Prediction

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

This repo is developed for Strong Baseline For Vehicle Re-Identification in Track 2 Ai-City-2021 Challenges