A pure pytorch implemented ocr project including text detection and recognition

Last update: Dec 30, 2022

Overview

ocr.pytorch

A pure pytorch implemented ocr project.
Text detection is based CTPN and text recognition is based CRNN.
More detection and recognition methods will be supported!

Prerequisite

python-3.5+
pytorch-0.4.1+
torchvision-0.2.1
opencv-3.4.0.14
numpy-1.14.3

They could all be installed through pip except pytorch and torchvision. As for pytorch and torchvision, they both depends on your CUDA version, you would prefer to reading pytorch's official site

Detection

Detection is based on CTPN, some codes are borrowed from pytorch_ctpn, several detection results:

Recognition

Recognition is based on CRNN, some codes are borrowed from crnn.pytorch

Test

Download pretrained models from Baidu Netdisk (extract code: u2ff) or Google Driver and put these files into checkpoints. Then run

python3 demo.py

The image files in ./test_images will be tested for text detection and recognition, the results will be stored in ./test_result.

If you want to test a single image, run

python3 test_one.py [filename]

Train

Training codes are placed into train_code directory.
Train CTPN
Train CRNN

Licence

MIT License

A pure pytorch implemented ocr project including text detection and recognition

Related tags

Overview

ocr.pytorch

Prerequisite

Detection

Recognition

Test

Train

Licence

Owner

coura

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

Perspective recovery of text using transformed ellipses

Image Detector and Convertor App created using python's Pillow, OpenCV, cvlib, numpy and streamlit packages.

An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".

Forked from argman/EAST for the ICPR MTWI 2018 CHALLENGE

Dirty, ugly, and hopefully useful OCR of Facebook Papers docs released by Gizmodo

Detect handwritten words in a text-line (classic image processing method).

Handwritten Character Recognition using CNN

Isearch (OSINT) 🔎 Face recognition reverse image search on Instagram profile feed photos.

Convert scans of handwritten notes to beautiful, compact PDFs

A simple component to display annotated text in Streamlit apps.

The code of "Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes"

Regions sanitàries (RS), Sectors Sanitàris (SS) i Àrees Bàsiques de Salut (ABS) de Catalunya

This repository provides train＆test code, dataset, det.&rec. annotation, evaluation script, annotation tool, and ranking.

A Python script to capture images from multiple webcams at once and save them into your local machine

Optical character recognition for Japanese text, with the main focus being Japanese manga

Textboxes_plusplus implementation with Tensorflow (python)

Run tesseract with the tesserocr bindings with @OCR-D's interfaces

"Very simple but works well" Computer Vision based ID verification solution provided by LibraX.

PAGE XML format collection for document image page content and more