Handwriting Recognition System based on a deep Convolutional Recurrent Neural Network architecture

Last update: Jan 07, 2023

Overview

Handwriting Recognition System

This repository is the Tensorflow implementation of the Handwriting Recognition System described in Handwriting Recognition of Historical Documents with Few Labeled Data (please cite the paper if you use this code in your research paper). This code was also used for the baseline system in Fine-tuning Handwriting Recognition systems with Temporal Dropout.

This code is free for academic and research use. For commercial use of the code please contact Edgard Chammas.

To help run the system, sample images from ICDAR2017 Competition on Handwritten Text Recognition on the READ Dataset are added.

Configuration

General configuration can be found in config.py

CNN-specific architecture configuration can be found in cnn.py

Training

python train.py

This will generate a text log file and a Tensorflow summary.

Decoding

python test.py

This will generate, for each image, the line transcription. The output will be written to decoded.txt by default.

python compute_probs.py

This will generate, for each image, the posterior probabilities at each timestep. Files will be stored in Probs by default.

Dependencies

Tensorflow
OpenCV-Python

Citation

Please cite the following paper if you use this code in your research paper:

@inproceedings{chammas2018handwriting,
  title={Handwriting Recognition of Historical Documents with few labeled data},
  author={Chammas, Edgard and Mokbel, Chafic and Likforman-Sulem, Laurence},
  booktitle={2018 13th IAPR International Workshop on Document Analysis Systems (DAS)},
  pages={43--48},
  year={2018},
  organization={IEEE}
}

Acknowledgment

We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Titan Xp GPU used for this research.

Contributions

Feel free to send your pull request or open issues.

Handwriting Recognition System based on a deep Convolutional Recurrent Neural Network architecture

Related tags

Overview

Handwriting Recognition System

Configuration

Training

Decoding

Dependencies

Citation

Acknowledgment

Contributions

Owner

Edgard Chammas

Repository for Scene Text Detection with Supervised Pyramid Context Network with tensorflow.

Deep Learning Chinese Word Segment

MONAI Label is a server-client system that facilitates interactive medical image annotation by using AI.

Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)

Tesseract Open Source OCR Engine (main repository)

Detect textlines in document images

list all open dataset about ocr.

https://arxiv.org/abs/1904.01941

[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

Text recognition (optical character recognition) with deep learning methods.

Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrieval.

Here use convulation with sobel filter from scratch in opencv python .

This repository contains codes on how to handle mouse event using OpenCV

Document blur detection based on Laplacian operator and text detection.

APS 6º Semestre - UNIP (2021)

The papers published in top-tier AI conferences in recent years.

The world's simplest facial recognition api for Python and the command line

This is a project to detect gestures to zoom in or out, using the real-time distance between the index finger and the thumb. It's based on OpenCV and Mediapipe.

OCR engine for all the languages

Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"