End-to-end pipeline for real-time scene text detection and recognition.

Last update: Aug 04, 2022

Overview

Real-time-Scene-Text-Detection-and-Recognition-System

End-to-end pipeline for real-time scene text detection and recognition.

The detection model use the EAST, the recognition model use the crnn.

Download trained EAST model from https://drive.google.com/file/d/0B3APw5BZJ67ETHNPaU9xUkVoV0U/view and put it in EAST/result.

Download trained crnn model from https://www.dropbox.com/s/dboqjk20qjkpta3/crnn.pth?dl=0 and put it in crnn/samples.

Screenshot

Video

Here is the full video of the real time system: https://youtu.be/1zvteqaU4h4

Owner

Fangneng Zhan

Computer Vision, Deep Learning.

GitHub Repository

Awesome multilingual OCR toolkits based on PaddlePaddle （practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices）

English | 简体中文 Introduction PaddleOCR aims to create multilingual, awesome, leading, and practical OCR tools that help users train better models and a

27.5k Jan 08, 2023

基于Paddle框架的PSENet复现

PSENet-Paddle 基于Paddle框架的PSENet复现本项目基于paddlepaddle框架复现PSENet，并参加百度第三届论文复现赛，将在2021年5月15日比赛完后提供AIStudio链接～敬请期待 AIStudio链接参考项目： whai362-PSENet 环境配置本项目

4 Apr 24, 2022

Computer vision applications project (Flask and OpenCV)

Computer Vision Applications Project This project is at it's initial phase. This is all about the implementation of different computer vision techniqu

1 Jan 26, 2022

ARU-Net - Deep Learning Chinese Word Segment

ARU-Net: A Neural Pixel Labeler for Layout Analysis of Historical Documents Contents Introduction Installation Demo Training Introduction This is the

128 Sep 12, 2022

Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)

Open Semantic Search https://opensemanticsearch.org Integrated search server, ETL framework for document processing (crawling, text extraction, text a

684 Jan 06, 2023

Reference Code for AAAI-20 paper "Multi-Stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labels"

Reference Code for AAAI-20 paper "Multi-Stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labels" Please refer to htt

1 Feb 14, 2022

Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"

Bailando Code for CVPR 2022 (oral) paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory" [Paper] | [Project Page] | [Vi

237 Dec 29, 2022

Face_mosaic - Mosaic blur processing is applied to multiple faces appearing in the video

動機 face_recognitionを使用して得られる顔座標は長方形であり、この座標をそのまま用いてぼかし処理を行った場合得られる画像は醜い。それに対してモ

6 Feb 03, 2022

Document blur detection based on Laplacian operator and text detection.

Document Blur Detection For general blurred image, using the variance of Laplacian operator is a good solution. But as for the blur detection of docum

5 Oct 20, 2022

Converts an image into funny, smaller amongus characters

SussyImage Converts an image into funny, smaller amongus characters Demo Mona Lisa | Lona Misa (Made up of AmongUs characters) API I've also added an

14 Aug 18, 2022

Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

Scene Text-Spotting based on PSEnet+CRNN Pytorch implementation of an end to end Text-Spotter with a PSEnet text detector and CRNN text recognizer. We

62 Oct 10, 2022

A Python wrapper for the tesseract-ocr API

tesserocr A simple, Pillow-friendly, wrapper around the tesseract-ocr API for Optical Character Recognition (OCR). tesserocr integrates directly with

1.7k Dec 31, 2022

Um RPG de texto orientado a objetos.

RPG de texto Um RPG de texto orientado a objetos, sem história. Um RPG (Role-playing game) baseado em texto em que você pode viajar para alguns locais

3 Oct 05, 2022

Convolutional Recurrent Neural Networks(CRNN) for Scene Text Recognition

CRNN_Tensorflow This is a TensorFlow implementation of a Deep Neural Network for scene text recognition. It is mainly based on the paper "An End-to-En

1000 Dec 27, 2022

Comparison-of-OCR (KerasOCR, PyTesseract,EasyOCR)

Optical Character Recognition OCR (Optical Character Recognition) is a technology that enables the conversion of document types such as scanned paper

21 Dec 25, 2022

Camelot: PDF Table Extraction for Humans

Camelot: PDF Table Extraction for Humans Camelot is a Python library that makes it easy for anyone to extract tables from PDF files! Note: You can als

3.3k Dec 31, 2022

Python tool that takes the OCR.space JSON output as input and draws a text overlay on top of the image.

OCR.space OCR Result Checker = Draw OCR overlay on top of image Python tool that takes the OCR.space JSON output as input, and draws an overlay on to

4 Oct 18, 2022

Ocular is a state-of-the-art historical OCR system.

Ocular Ocular is a state-of-the-art historical OCR system. Its primary features are: Unsupervised learning of unknown fonts: requires only document im

228 Dec 30, 2022

An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments

Tips: A more recent scene text detection algorithm: PixelLink, has been implemented here: https://github.com/ZJULearning/pixel_link Contents: Introduc

484 Dec 07, 2022

Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)

SA-AutoAug Scale-aware Automatic Augmentation for Object Detection Yukang Chen, Yanwei Li, Tao Kong, Lu Qi, Ruihang Chu, Lei Li, Jiaya Jia [Paper] [Bi

182 Dec 29, 2022

End-to-end pipeline for real-time scene text detection and recognition.

Related tags

Overview

Real-time-Scene-Text-Detection-and-Recognition-System

Screenshot

Video

Owner

Fangneng Zhan

Awesome multilingual OCR toolkits based on PaddlePaddle （practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices）

基于Paddle框架的PSENet复现

Computer vision applications project (Flask and OpenCV)

ARU-Net - Deep Learning Chinese Word Segment

Reference Code for AAAI-20 paper "Multi-Stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labels"

Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"

Face_mosaic - Mosaic blur processing is applied to multiple faces appearing in the video

Document blur detection based on Laplacian operator and text detection.

Converts an image into funny, smaller amongus characters

Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

A Python wrapper for the tesseract-ocr API

Um RPG de texto orientado a objetos.

Convolutional Recurrent Neural Networks(CRNN) for Scene Text Recognition

Comparison-of-OCR (KerasOCR, PyTesseract,EasyOCR)

Camelot: PDF Table Extraction for Humans

Python tool that takes the OCR.space JSON output as input and draws a text overlay on top of the image.

Ocular is a state-of-the-art historical OCR system.

An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments

Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)