An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

Last update: Dec 12, 2022

Related tags

Overview

InceptText-Tensorflow

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

Introduction

Tensorflow=1.4.0

Preparation

1.gcc 4.9

2.cuda8.0

3.cd lib && make

可能遇到的错误：

解决办法：把cuda路径添加到系统环境变量，然后改为#include<cuda.h>

解决办法：找到nsync_cv.h的绝对路径然后include

解决办法：找到nsync_mu.h的绝对路径然后include

Download

1.Models trained on ICDAR 2017

2.Resnet V1 50 provided by tensorflow slimResNet-v1

Train

python train_main.py

Test

python test.py

Owner

GeorgeJoe

Focus on NLP and OCR

GitHub Repository

Captcha Recognition

The objective of this project is to recognize the target numbers in the captcha images correctly which would tell us how good or bad a captcha system has been built.

5 Feb 20, 2022

Extract tables from scanned image PDFs using Optical Character Recognition.

ocr-table This project aims to extract tables from scanned image PDFs using Optical Character Recognition. Install Requirements Tesseract OCR sudo apt

209 Dec 06, 2022

Python-based tools for document analysis and OCR

ocropy OCRopus is a collection of document analysis programs, not a turn-key OCR system. In order to apply it to your documents, you may need to do so

3.2k Dec 31, 2022

Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)

An Image is Worth 16x16 Words, What is a Video Worth? paper Official PyTorch Implementation Gilad Sharir, Asaf Noy, Lihi Zelnik-Manor DAMO Academy, Al

213 Nov 12, 2022

Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrieval.

Dual Encoding for Video Retrieval by Text Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding

81 Dec 01, 2022

A bot that extract text from images using the Tesseract OCR.

Text from image (OCR) @ocr_text_bot A simple bot to extract text from images. Usage What do I need? A AWS key configured locally, see here. NodeJS. I

4 Aug 06, 2021

Pixel art search engine for opengameart

Pixel Art Reverse Image Search for OpenGameArt What does the final search look like? The final search with an example can be found here. It looks like

92 Nov 06, 2022

Satoshi is a discord bot template in python using discord.py that allow you to track some live crypto prices with your own discord bot.

Satoshi ~ DiscordCryptoBot Satoshi is a simple python discord bot using discord.py that allow you to track your favorites cryptos prices with your own

2 Sep 15, 2022

Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.

doc2text doc2text extracts higher quality text by fixing common scan errors Developing text corpora can be a massive pain in the butt. Much of the tex

1.3k Jan 04, 2023

7th place solution

SIIM-FISABIO-RSNA-COVID-19-Detection 7th place solution Validation: We used iterative-stratification with 5 folds (https://github.com/trent-b/iterativ

11 Jul 17, 2022

Automatic Number Plate Recognition (ANPR) is a highly accurate system capable of reading vehicle number plates without human intervention

ANPR ANPR is therefore the underlying technology used to find a vehicle license/number plate and it, in turn, supplies this information to a next stag

1 Jan 09, 2022

Installations for running keras-theano on GPU Upgrade pip and install opencv2 cd ~ pip install --upgrade pip pip install opencv-python Upgrade keras

14 Sep 30, 2022

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

Related tags

Overview

InceptText-Tensorflow

Introduction

Tensorflow=1.4.0

Preparation

Download

1.Models trained on ICDAR 2017

2.Resnet V1 50 provided by tensorflow slimResNet-v1

Train

python train_main.py

Test

python test.py

Owner

GeorgeJoe

Captcha Recognition

Extract tables from scanned image PDFs using Optical Character Recognition.

Python-based tools for document analysis and OCR

Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)

Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrieval.

A bot that extract text from images using the Tesseract OCR.

Pixel art search engine for opengameart

Satoshi is a discord bot template in python using discord.py that allow you to track some live crypto prices with your own discord bot.

Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.

7th place solution

Automatic Number Plate Recognition (ANPR) is a highly accurate system capable of reading vehicle number plates without human intervention

Responsive Doc. scanner using U^2-Net, Textcleaner and Tesseract

CNN+LSTM+CTC based OCR implemented using tensorflow.

kaldi-asr/kaldi is the official location of the Kaldi project.

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Rotational region detection based on Faster-RCNN.

Tracking the latest progress in Scene Text Detection and Recognition: Must-read papers well organized

Resizing Canny Countour In Python

Maze generator and solver with python