This repository provides train＆test code, dataset, det.&rec. annotation, evaluation script, annotation tool, and ranking.

Last update: Dec 18, 2022

Overview

SCUT-CTW1500 Datasets

We have updated annotations for both train and test set.

Train: 1000 images [images][annos]

Additional point annotation for each character is included. Example can be referred to here.

wget -O train_images.zip https://universityofadelaide.box.com/shared/static/py5uwlfyyytbb2pxzq9czvu6fuqbjdh8.zip
wget -O train_labels.zip https://universityofadelaide.box.com/shared/static/jikuazluzyj4lq6umzei7m2ppmt3afyw.zip

Test: 500 images [images][annos]

wget -O test_images.zip https://universityofadelaide.box.com/shared/static/t4w48ofnqkdw7jyc4t11nsukoeqk9c3d.zip
wget -O test_labels.zip https://cloudstor.aarnet.edu.au/plus/s/uoeFl0pCN9BOCN5/download

Note all Chinese texts are annotated with '###' (ignore) in this updated version, because the number of Chinese is too small for both training and testing purpose. ArT and LSVT two optional large-scale arbitrarily-shaped text benchmarks for both Chinese and English.

SCUT-CTW1500 Evaluation

Original detection only evaluation script.

For both detection and end-to-end evaluation in the updated version, please refer to here. This scipt also includes evaluation example for Total-text.

Info

The project is outdated and will not be maintained anymore. Original info is kept in OLD_README.md.

Copyright

The SCUT-CTW1500 database is free to the academic community for research only.

For other purpose, please contact Dr. Lianwen Jin: [email protected]

This repository provides train＆test code, dataset, det.&rec. annotation, evaluation script, annotation tool, and ranking.

Related tags

Overview

SCUT-CTW1500 Datasets

SCUT-CTW1500 Evaluation

Info

Copyright

Owner

Yuliang Liu

Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.

Brief idea about our project is mentioned in project presentation file.

利用Paddle框架复现CRAFT

Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Localization of thoracic abnormalities model based on VinBigData (top 1%)

Opencv face recognition desktop application

Automatically resolve RidderMaster based on TensorFlow & OpenCV

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

A toolbox of scene text detection and recognition

This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.

A tool combining EasyOCR and LaMa to automatically detect text and replace it with an inpainted background.

Simple app for visual editing of Page XML files

Scene text recognition

Code for CVPR 2022 paper "SoftGroup for Instance Segmentation on 3D Point Clouds"

YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

This tool will help you convert your text to handwriting xD

Contextual speed detection for python

PyNeuro is designed to connect NeuroSky's MindWave EEG device to Python and provide Callback functionality to provide data to your application in real time.

The papers published in top-tier AI conferences in recent years.