Code for Text Prior Guided Scene Text Image Super-Resolution

Last update: Dec 26, 2022

Related tags

Text Data & NLP TPGSR

Overview

Text Prior Guided Scene Text Image Super-Resolution

https://arxiv.org/abs/2106.15368

Jianqi Ma, Shi Guo, Lei Zhang
Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China

Recovering TextZoom samples

Environment:

Other possible python packages like pyyaml, cv2, Pillow and imgaug

Main idea

Single stage with loss

Multi-stage version

Configure your training

Download the pretrained recognizer from:

Aster: https://github.com/ayumiymk/aster.pytorch  
MORAN:  https://github.com/Canjie-Luo/MORAN_v2  
CRNN: https://github.com/meijieru/crnn.pytorch

Unzip the codes and walk into the '$TPGSR_ROOT$/', place the pretrained weights from recognizer in '$TPGSR_ROOT$/'.

Download the TextZoom dataset:

https://github.com/JasonBoy1/TextZoom

Train the corresponding model (e.g. TPGSR-TSRN):

chmod a+x train_TPGSR-TSRN.sh
./train_TPGSR-TSRN.sh
or
python3 main.py --arch="tsrn_tl_cascade" \       # The architecture
                --batch_size=48 \                # The batch size
                --STN \                          # Using STN net for alignment
		--mask \                         # Using the contour mask
		--use_distill \                  # Using the TP loss
		--gradient \                     # Using the Gradient Prior Loss
		--sr_share \                     # Sharing weights for SR Module
		--stu_iter=1 \                   # The number of interations in multi-stage version
		--vis_dir='vis_TPGSR-TSRN' \     # The checkpoint directory

Run the test-prefixed shell to test the corresponding model.

Adding '--go_test' in the shell file

Cite this paper:

@article{ma2021text,
title={Text Prior Guided Scene Text Image Super-resolution},
author={Ma, Jianqi and Guo, Shi and Zhang, Lei},
journal={arXiv preprint arXiv:2106.15368},
year={2021}
}

Code for Text Prior Guided Scene Text Image Super-Resolution

Related tags

Overview

Text Prior Guided Scene Text Image Super-Resolution

Recovering TextZoom samples

Environment:

Main idea

Single stage with loss

Multi-stage version

Configure your training

Download the pretrained recognizer from:

Download the TextZoom dataset:

Train the corresponding model (e.g. TPGSR-TSRN):

Run the test-prefixed shell to test the corresponding model.

Cite this paper:

Owner

Unsupervised Document Expansion for Information Retrieval with Stochastic Text Generation

Fast topic modeling platform

HF's ML for Audio study group

OceanScript is an Esoteric language used to encode and decode text into a formulation of characters

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

Scikit-learn style model finetuning for NLP

Fine-tune GPT-3 with a Google Chat conversation history

Open solution to the Toxic Comment Classification Challenge

Script and models for clustering LAION-400m CLIP embeddings.

The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.

Course project of [email protected]

NLP-SentimentAnalysis - Coursera Course ( Duration : 5 weeks ) offered by DeepLearning.AI

Collection of scripts to pinpoint obfuscated code

Creating a Feed of MISP Events from ThreatFox (by abuse.ch)

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.

Yet another Python binding for fastText

Journalism AI – Quotes extraction for modular journalism

Code for text augmentation method leveraging large-scale language models

NLP made easy

Subtitle Workshop (subshop): tools to download and synchronize subtitles