Detect and fix skew in images containing text

Last update: Dec 21, 2022

Overview

Alyn

Skew detection and correction in images containing text

Image with skew

Image after deskew

Install and use via pip!

Recommended way(using virtualenv):

mkdir alyn-test
cd alyn test
mkvirtualenv .
pip install alyn
source bin/activate

To detect skew angle in image:

from alyn import SkewDetect
sd = SkewDetect(
	input_file='path_to_file',
	batch_path='optional_batch_processing_path',
	output_file='optional_text_file_output_path',
	display_output='Yes/No')
sd.run()

Extra options:

sigma:canny edge detection blurring
plot_hough: display hough lines detected
num_peaks: control the number of hough line peaks

To deskew image:

from alyn import Deskew
d = Deskew(
	input_file='path_to_file',
	display_image='preview the image on screen',
	output_file='path_for_deskewed image',
	r_angle='offest_angle_in_degrees_to_control_orientation')`
d.run()

Requires

numpy
matplotlib
scipy
scikit-image

Techniques used

Canny Edge Detection
Hough Transform

Features

Detect the skew in given images
Display the output
Save the output to txt file
Batch process files in a directory
View Hough Transform of a given image
Set the number of peaks for Hough Transform and Sigma for Canny Edge detection
Rotate the image to remove the skew

How the skew detection works

The skew detection script takes image file as input, then performs the following steps:

Converts the image to greyscale
Performs Canny Edge Detection on the Image
Calculates the Hough Transform values
Determines the peaks
Determines the deviation of each peaks from 45 degree angle
Segregates the detected peaks into bins
Chooses the probable skew angle using the value in the bins

The deskew script uses the skew angle determined using skew detection script to remove the skew from the image.

Using scripts directly(older method)

Image skew calculation using skew_detect.py

To calculate the skew angle for a given image file, use -i option followed by the path to file:

./skew_detect.py -i image.jpg

To save output in a text file add -o option followed by the output file name:

./skew_detect.py -i image.jpg -o output.txt

To display output information add -d option followed by a string Yes:

./skew_detect.py -i image.jpg -d Yes

To batch process files in a directory, use -b option followed by the path to directory:

./skew_detect.py  -b examples

To display Hough Transform plot for an image,:

./skew_detect.py -i image.jpg -p Yes

Output of the Hough Transform:

To set the value of sigma for Gaussian blurring in Canny Edge Detection, use -s option followed by the desired value:

./skew_detect.py -i image.jpg -s 3

To set the number of peaks collected from Hough Transform, use -n option followed by the desired value:

./skew_detect.py -i image.jpg -n 10

Image Deskew using deskew.py

To perform a simple deskew and display the output:

./deskew.py -i image.jpg -d Yes

To save the deskewed image, use the following:

./deskew.py -i image.jpg -o rotated.jpg

In some cases the result image might be upside down or the text may be running vertical, To fix this, use -r followed by the desired angle in int:

./deskew.py -i image.jpg -o rotated.jpg -r 90

To generate data for experimental purposes, run the test_img_gen.py in test_data folder. This will generate images containing a white line having angle between 0 to 180 degrees.

Detect and fix skew in images containing text

Related tags

Overview

Alyn

Skew detection and correction in images containing text

Image with skew

Image after deskew

Install and use via pip!

To detect skew angle in image:

Extra options:

To deskew image:

Requires

Techniques used

Features

How the skew detection works

Using scripts directly(older method)

Image skew calculation using skew_detect.py

Output of the Hough Transform:

Image Deskew using deskew.py

Owner

Kakul

Handwritten Text Recognition (HTR) system implemented with TensorFlow.

YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

OpenCV-Erlang/Elixir bindings

Aloception is a set of package for computer vision: aloscene, alodataset, alonet.

Official code for :rocket: Unsupervised Change Detection of Extreme Events Using ML On-Board :rocket:

Python package for handwriting and sketching in Jupyter cells

This is a repository to learn and get more computer vision skills, make robotics projects integrating the computer vision as a perception tool and create a lot of awesome advanced controllers for the robots of the future.

An expandable and scalable OCR pipeline

FastOCR is a desktop application for OCR API.

Python tool that takes the OCR.space JSON output as input and draws a text overlay on top of the image.

textspotter - An End-to-End TextSpotter with Explicit Alignment and Attention

SceneCollisionNet This repo contains the code for "Object Rearrangement Using Learned Implicit Collision Functions", an ICRA 2021 paper. For more info

An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

Face Anonymizer - FaceAnonApp v1.0

Official implementation of Character Region Awareness for Text Detection (CRAFT)

Convert scans of handwritten notes to beautiful, compact PDFs

An application of high resolution GANs to dewarp images of perturbed documents

OCR, Scene-Text-Understanding, Text Recognition

Detect textlines in document images