Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

Last update: Jan 01, 2023

Overview

ImageProcessingTransformer

Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

The latest version contains some important modifications according to the official mindspore implementation. It makes convergecy a lot faster. Please make sure you update to the latest version.

only contain model definition file and train/test file. Dataloader file is not yet released. You could implement your own dataloader. It may be released in the next version.

To pretrain on random task

python main.py --seed 0 \
--lr 5e-5 \
--save-path "./ckpt" \
--epochs 300 \
--data path-to-data \
--batch-size 256

To finetune on a specific task

python main.py --seed 0 \
--lr 2e-5 \
--save-path "./ckpt" \
--epochs 30 \
--reset-epoch \
--data path-to-data \
--batch-size 256 \
--resume path-to-pretrain-model \
--task "dehaze"

To eval on a specific task

python main.py --seed 0 \
--eval-data path-to-val-data \
--batch-size 256 \
--eval \
--resume path-to-model \
--task "dehaze"

Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

Related tags

Overview

ImageProcessingTransformer

Owner

A tensorflow implementation of an HMM layer

NeRD: Neural Reflectance Decomposition from Image Collections

Fast, accurate and reliable software for algebraic CT reconstruction

Transport Mode detection - can detect the mode of transport with the help of features such as acceeration,jerk etc

Benchmark for evaluating open-ended generation

BC3407-Group-5-Project - BC3407 Group Project With Python

MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens

Spatial-Temporal Transformer for Dynamic Scene Graph Generation, ICCV2021

AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty

Deep Learning for Natural Language Processing SS 2021 (TU Darmstadt)

ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives

免费获取http代理并生成proxifier配置文件

Deep Face Recognition in PyTorch

An Efficient Training Approach for Very Large Scale Face Recognition or F²C for simplicity.

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,

TrackTech: Real-time tracking of subjects and objects on multiple cameras

PyTorch code for 'Efficient Single Image Super-Resolution Using Dual Path Connections with Multiple Scale Learning'

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

ViDT: An Efficient and Effective Fully Transformer-based Object Detector

From Fidelity to Perceptual Quality: A Semi-Supervised Approach for Low-Light Image Enhancement (CVPR'2020)

Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

Related tags

Overview

ImageProcessingTransformer

Owner

A tensorflow implementation of an HMM layer

NeRD: Neural Reflectance Decomposition from Image Collections

Fast, accurate and reliable software for algebraic CT reconstruction

Transport Mode detection - can detect the mode of transport with the help of features such as acceeration,jerk etc

Benchmark for evaluating open-ended generation

BC3407-Group-5-Project - BC3407 Group Project With Python

MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens

Spatial-Temporal Transformer for Dynamic Scene Graph Generation, ICCV2021

AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty

Deep Learning for Natural Language Processing SS 2021 (TU Darmstadt)

ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives

免费获取http代理并生成proxifier配置文件

Deep Face Recognition in PyTorch

An Efficient Training Approach for Very Large Scale Face Recognition or F²C for simplicity.

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,

TrackTech: Real-time tracking of subjects and objects on multiple cameras

PyTorch code for 'Efficient Single Image Super-Resolution Using Dual Path Connections with Multiple Scale Learning'

Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)

ViDT: An Efficient and Effective Fully Transformer-based Object Detector

From Fidelity to Perceptual Quality: A Semi-Supervised Approach for Low-Light Image Enhancement (CVPR'2020)

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,