Document blur detection based on Laplacian operator and text detection.

Last update: Oct 20, 2022

Overview

Document Blur Detection

For general blurred image, using the variance of Laplacian operator is a good solution. But as for the blur detection of documents, especially for document images with blurred text, text detection should be used to detect blurred text area.

This package mainly depends on opencv and paddle, to install them with requirements.txt,

pip install -r requirements

Inference model of PaddleOCR is used to detect text location. You can download the inference model with https://paddleocr.bj.bcebos.com/dygraph_v2.0/ch/ch_ppocr_mobile_v2.0_det_infer.tar. The text detection code in this project refers to the PaddleOCR project. If you want to get more information about PaddleOCR, you can go to https://github.com/PaddlePaddle/PaddleOCR to check it out.

To run main.py, use the following command.

python ./main.py --image './text_blur.jpg' --thresh_v 300 --thresh_d 0.7

If you would like to blur document images, you can run blur_ops.py to simulate motion blur and Gaussian blur. Use the following command.

python blur_ops.py --image_path './bean-license.png' --output_path './gaussian_blur.jpg' --blur_type 'gaussian blur'/'motion blur'

Some results:

Document blur detection based on Laplacian operator and text detection.

Related tags

Overview

Document Blur Detection

Owner

JoeyLr

Amazing 3D explosion animation using Pygame module.

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約

SceneCollisionNet This repo contains the code for "Object Rearrangement Using Learned Implicit Collision Functions", an ICRA 2021 paper. For more info

This pyhton script converts a pdf to Image then using tesseract as OCR engine converts Image to Text

An unofficial package help developers to implement ZATCA (Fatoora) QR code easily which required for e-invoicing

The official code for the ICCV-2021 paper "Speech Drives Templates: Co-Speech Gesture Synthesis with Learned Templates".

learn how to use Gesture Control to change the volume of a computer

nofacedb/faceprocessor is a face recognition engine for NoFaceDB program complex.

Convert scans of handwritten notes to beautiful, compact PDFs

A small C++ implementation of LSTM networks, focused on OCR.

Code for paper "Role-based network embedding via structural features reconstruction with degree-regularized constraint"

[EMNLP 2021] Improving and Simplifying Pattern Exploiting Training

This project proposes a camera vision based cursor control system, using hand moment captured from a webcam through a landmarks of hand by using Mideapipe module

Official implementation of Character Region Awareness for Text Detection (CRAFT)

A post-processing tool for scanned sheets of paper.

scantailor - Scan Tailor is an interactive post-processing tool for scanned pages.

Text language identification using Wikipedia data

Give a solution to recognize MaoYan font.

Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector

Single Shot Text Detector with Regional Attention

Document blur detection based on Laplacian operator and text detection.

Related tags

Overview

Document Blur Detection

Owner

JoeyLr

Amazing 3D explosion animation using Pygame module.

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約

SceneCollisionNet This repo contains the code for "Object Rearrangement Using Learned Implicit Collision Functions", an ICRA 2021 paper. For more info

This pyhton script converts a pdf to Image then using tesseract as OCR engine converts Image to Text

An unofficial package help developers to implement ZATCA (Fatoora) QR code easily which required for e-invoicing

The official code for the ICCV-2021 paper "Speech Drives Templates: Co-Speech Gesture Synthesis with Learned Templates".

learn how to use Gesture Control to change the volume of a computer

nofacedb/faceprocessor is a face recognition engine for NoFaceDB program complex.

Convert scans of handwritten notes to beautiful, compact PDFs

A small C++ implementation of LSTM networks, focused on OCR.

Code for paper "Role-based network embedding via structural features reconstruction with degree-regularized constraint"

[EMNLP 2021] Improving and Simplifying Pattern Exploiting Training

This project proposes a camera vision based cursor control system, using hand moment captured from a webcam through a landmarks of hand by using Mideapipe module

Official implementation of Character Region Awareness for Text Detection (CRAFT)

A post-processing tool for scanned sheets of paper.

scantailor - Scan Tailor is an interactive post-processing tool for scanned pages.

Text language identification using Wikipedia data

Give a solution to recognize MaoYan font.

Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector

Single Shot Text Detector with Regional Attention

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約