Towards Boosting the Accuracy of Non-Latin Scene Text Recognition

Last update: Aug 07, 2022

Related tags

Deep Learning NonLatinPhotoOCR

Overview

Convolutional Recurrent Neural Network + CTCLoss | STAR-Net

Code for paper "Towards Boosting the Accuracy of Non-Latin Scene Text Recognition"

Dependence

Python3.6.5
torch==1.2.0
torchvision==0.4.0
tensorboard==2.3.0

How to run the code?

Prepare data

Follow the instructions in meijieru/crnn.pytorch to create lmdb datasets. Use the same step to create train and val data.

Change parameters and alphabets

Please update the parameters and alphabets according to the requirement.

Change parameters in the mytrain.py file
Change alphabets

Please put all the alphabets that appear in your labels in a file and input the list as charlist to mytrain.py, else the program will throw an error during training.

Train

Run mytrain.py -

python3 mytrain.py --trainRoot /ssd_scratch/cvit/sanjana/hindi-train-lmdb \
--valRoot /ssd_scratch/cvit/sanjana/hindi-test-lmdb \
--arch crnn --lan hindi --charlist /ssd_scratch/cvit/sanjana/crnn_new/lexicon.txt \
--batchSize 32 --nepoch 15 --cuda --expr_dir /ssd_scratch/cvit/sanjana \
--displayInterval 10 --valInterval 100 --adadelta \ 
--manualSeed 1234 --random_sample --deal_with_lossnan

Reference

meijieru/crnn.pytorch
Sierkinhane/crnn_chinese_characters_rec

If you use the dataset or code from this work, please add the following citation:-

@inproceedings{gunnaNonLatin2021,
  title={Towards {B}oosting the {A}ccuracy of {N}on-{L}atin {S}cene {T}ext {R}ecognition,
  author={Sanjana Gunna and Rohit Saluja and C V Jawahar},
  booktitle={2021 International Conference on Document Analysis and Recognition Workshops (ICDARW)},
  year={2021},
  organization={IEEE}
}

Towards Boosting the Accuracy of Non-Latin Scene Text Recognition

Related tags

Overview

Convolutional Recurrent Neural Network + CTCLoss | STAR-Net

Dependence

How to run the code?

Prepare data

Change parameters and alphabets

Train

Reference

Owner

Sanjana Gunna

Computer Vision application in the web

This repository contains code, network definitions and pre-trained models for working on remote sensing images using deep learning

Code and dataset for AAAI 2021 paper FixMyPose: Pose Correctional Describing and Retrieval Hyounghun Kim, Abhay Zala, Graham Burri, Mohit Bansal.

✨风纪委员会自动投票脚本，利用Github Action帮你进行裁决操作（为了让其他风纪委员有案件可判，本程序从中午12点才开始运行，有需要请自己修改运行时间）

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling

Live training loss plot in Jupyter Notebook for Keras, PyTorch and others

codes for paper Combining Dynamic Local Context Focus and Dependency Cluster Attention for Aspect-level sentiment classification

The official repository for BaMBNet

A hue shift helper for OBS

Python implementation of NARS (Non-Axiomatic-Reasoning-System)

Fast and customizable reconnaissance workflow tool based on simple YAML based DSL.

一个多模态内容理解算法框架，其中包含数据处理、预训练模型、常见模型以及模型加速等模块。

Real life contra a deep learning project built using mediapipe and openc

This script runs neural style transfer against the provided content image.

The Official Repository for "Generalized OOD Detection: A Survey"

SoGCN: Second-Order Graph Convolutional Networks

PyTorch implementation of Soft-DTW: a Differentiable Loss Function for Time-Series in CUDA

The PyTorch implementation of Directed Graph Contrastive Learning (DiGCL), NeurIPS-2021

Transformers provides thousands of pretrained models to perform tasks on different modalities such as text, vision, and audio.

Ready-to-use code and tutorial notebooks to boost your way into few-shot image classification.