Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Last update: Dec 06, 2022

Related tags

Overview

This is the official implementation of "Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation".

For more details, please refer to our paper.

Citing the paper

Please cite the paper in your publications if it helps your research:

@inproceedings{lyu2018multi,
      title={Multi-oriented scene text detection via corner localization and region segmentation},
      author={Lyu, Pengyuan and Yao, Cong and Wu, Wenhao and Yan, Shuicheng and Bai, Xiang},
      booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
      pages={7553--7563},
      year={2018}
}

Requirements
Installation
Models
Test
Train
License

Requirements

NVIDIA GPU, Ubuntu 14.04, Python2.7, CUDA8/9
PyTorch 0.2.0_3

Installation

git clone https://github.com/lvpengyuan/corner.git
sh ./make.sh   or  cd rpsroi_pooling && python build.py

Models

Download the model and place it in weights/

Our trained model: Google Drive;

Test

You can test a model in a single scale:

python eval_all.py

or in multi-scale:

python eval_multiscale.py

Note that, you should modify the model path and the test dataset before testing.

Train

python train.py

To train a new model, you should modify the training settings before training.

License

This code is only for academic purpose.

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Related tags

Overview

Citing the paper

Contents

Requirements

Installation

Models

Test

Train

License

Owner

Pengyuan Lyu

Tesseract Open Source OCR Engine (main repository)

A simple component to display annotated text in Streamlit apps.

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Hand gesture detection project with aweome UI implementation.

Semantic-based Patch Detection for Binary Programs

Recognizing cropped text in natural images.

EQFace: An implementation of EQFace: A Simple Explicit Quality Network for Face Recognition

This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:

Document Layout Analysis

Framework for the Complete Gaze Tracking Pipeline

Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

A simple document layout analysis using Python-OpenCV

Scene text recognition

Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).

Virtualdragdrop - Virtual Drag and Drop Using OpenCV and Arduino

OCR powered screen-capture tool to capture information instead of images

A pure pytorch implemented ocr project including text detection and recognition

Write-ups for the SwissHackingChallenge2021 CTF.

OpenCV-Erlang/Elixir bindings