Machine Learning to Denoise Images for Better OCR Accuracy

This project is an adaptation of this tutorial and used only for learning purposes: https://www.pyimagesearch.com/2021/10/20/using-machine-learning-to-denoise-images-for-better-ocr-accuracy/#download-the-code

Setting Up the project 🚀

First and foremost clone the project with:

$ git clone https://github.com/AntonioBriPerez/Ocr-Denoiser

You don't need to extract the zip files in order to train the model.

Once you have cloned the repository you will need to extract the features from the noisy images. This script will extract 5 x 5 - 25-d feature vectors and the it will extract the target (or cleaned) pixel value from the correspondiente ground truth standard image. And then, this features will be saved in a csv file (~200MB). To extract this features you will have to execute:

$ python3 build_features.py

It will generate the following output:

Once you have done that we will have to load those features in a proper split to train our Random Forest Regressor. That code is implemented in the file train_denoiser.py. To train the model you will have to run the command:

$ python train_denoiser.py

And it will generate:

To check that the model performs good you can execute:

$ python3 denoise_document.py --testing denoising-dirty-documents/test

And some images will be written in disk so you can check the original image and the image obtained by the model we just have trained.

Any doubts or suggestions please open an issue.

Machine Leaning applied to denoise images to improve OCR Accuracy

Related tags

Overview

Machine Learning to Denoise Images for Better OCR Accuracy

Setting Up the project 🚀

Owner

Antonio Bri Pérez

Apply different text recognition services to images of handwritten documents.

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Image processing using OpenCv

Text layer for bio-image annotation.

Repositório para registro de estudo da biblioteca opencv (Python)

A set of workflows for corpus building through OCR, post-correction and normalisation

ocroseg - This is a deep learning model for page layout analysis / segmentation.

Text recognition (optical character recognition) with deep learning methods.

docstrum

M-LSDを用いて四角形を検出し、射影変換を行うサンプルプログラム

Controlling the computer volume with your hands // OpenCV

Handwritten Text Recognition (HTR) system implemented with TensorFlow (TF) and trained on the IAM off-line HTR dataset. This Neural Network (NN) model recognizes the text contained in the images of segmented words.

CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras

TextBoxes++: A Single-Shot Oriented Scene Text Detector

Lightning Fast Language Prediction 🚀

DouZero is a reinforcement learning framework for DouDizhu - 斗地主AI

Pixie - A full-featured 2D graphics library for Python

Code related to "Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity" paper

A fastai/PyTorch package for unpaired image-to-image translation.

Creating a virtual tv using opencv in python3.