An OCR evaluation tool

Last update: Dec 20, 2022

Overview

dinglehopper

dinglehopper is an OCR evaluation tool and reads ALTO, PAGE and text files. It compares a ground truth (GT) document page with a OCR result page to compute metrics and a word/character differences report.

Goals

Useful
- As a UI tool
- For an automated evaluation
- As a library
Unicode support

Installation

It's best to use pip, e.g.:

sudo pip install .

Usage

Usage: dinglehopper [OPTIONS] GT OCR [REPORT_PREFIX]

  Compare the PAGE/ALTO/text document GT against the document OCR.

  dinglehopper detects if GT/OCR are ALTO or PAGE XML documents to extract
  their text and falls back to plain text if no ALTO or PAGE is detected.

  The files GT and OCR are usually a ground truth document and the result of
  an OCR software, but you may use dinglehopper to compare two OCR results.
  In that case, use --no-metrics to disable the then meaningless metrics and
  also change the color scheme from green/red to blue.

  The comparison report will be written to $REPORT_PREFIX.{html,json}, where
  $REPORT_PREFIX defaults to "report". The reports include the character
  error rate (CER) and the word error rate (WER).

  By default, the text of PAGE files is extracted on 'region' level. You may
  use "--textequiv-level line" to extract from the level of TextLine tags.

Options:
  --metrics / --no-metrics  Enable/disable metrics and green/red
  --textequiv-level LEVEL   PAGE TextEquiv level to extract text from
  --progress                Show progress bar
  --help                    Show this message and exit.

For example:

dinglehopper some-document.gt.page.xml some-document.ocr.alto.xml

This generates report.html and report.json.

dinglehopper-extract

The tool dinglehopper-extract extracts the text of the given input file on stdout, for example:

dinglehopper-extract --textequiv-level line OCR-D-GT-PAGE/00000024.page.xml

OCR-D

As a OCR-D processor:

ocrd-dinglehopper -I OCR-D-GT-PAGE,OCR-D-OCR-TESS -O OCR-D-OCR-TESS-EVAL

This generates HTML and JSON reports in the OCR-D-OCR-TESS-EVAL filegroup.

The OCR-D processor has these parameters:

Parameter	Meaning
`-P metrics false`	Disable metrics and the green-red color scheme (default: enabled)
`-P textequiv_level line`	(PAGE) Extract text from TextLine level (default: TextRegion level)

For example:

ocrd-dinglehopper -I ABBYY-FULLTEXT,OCR-D-OCR-CALAMARI -O OCR-D-OCR-COMPARE-ABBYY-CALAMARI -P metrics false

Developer information

Please refer to README-DEV.md.

An OCR evaluation tool

Related tags

Overview

dinglehopper

Goals

Installation

Usage

dinglehopper-extract

OCR-D

Developer information

Owner

QURATOR-SPK

A simple python program to record security cam footage by detecting a face and body of a person in the frame.

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

Bu uygulamada Python ve Opencv kullanarak bilgisayar kamerasından yüz tespiti yapıyoruz.

Program created with opencv that allows you to automatically count your repetitions on several fitness exercises.

An expandable and scalable OCR pipeline

Text-to-Image generation

This is a repository to learn and get more computer vision skills, make robotics projects integrating the computer vision as a perception tool and create a lot of awesome advanced controllers for the robots of the future.

Camelot: PDF Table Extraction for Humans

Read Japanese manga inside browser with selectable text.

Deep Learning Chinese Word Segment

Rest API Written In Python To Classify NSFW Images.

Erosion and dialation using structure element in OpenCV python

Introduction to Augmented Reality (AR) with Python 3 and OpenCV 4.2.

Textboxes : Image Text Detection Model : python package (tensorflow)

Detect and fix skew in images containing text

Handwritten Number Recognition using CNN and Character Segmentation

Run tesseract with the tesserocr bindings with @OCR-D's interfaces

a Deep Learning Framework for Text

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約

An OCR evaluation tool

Related tags

Overview

dinglehopper

Goals

Installation

Usage

dinglehopper-extract

OCR-D

Developer information

Owner

QURATOR-SPK

A simple python program to record security cam footage by detecting a face and body of a person in the frame.

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

Bu uygulamada Python ve Opencv kullanarak bilgisayar kamerasından yüz tespiti yapıyoruz.

Program created with opencv that allows you to automatically count your repetitions on several fitness exercises.

An expandable and scalable OCR pipeline

Text-to-Image generation

This is a repository to learn and get more computer vision skills, make robotics projects integrating the computer vision as a perception tool and create a lot of awesome advanced controllers for the robots of the future.

Camelot: PDF Table Extraction for Humans

Read Japanese manga inside browser with selectable text.

Deep Learning Chinese Word Segment

Rest API Written In Python To Classify NSFW Images.

Erosion and dialation using structure element in OpenCV python

Introduction to Augmented Reality (AR) with Python 3 and OpenCV 4.2.

Textboxes : Image Text Detection Model : python package (tensorflow)

Detect and fix skew in images containing text

Handwritten Number Recognition using CNN and Character Segmentation

Run tesseract with the tesserocr bindings with @OCR-D's interfaces

a Deep Learning Framework for Text

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約