Textboxes : Image Text Detection Model : python package (tensorflow)

Last update: Dec 15, 2022

Overview

shinTB

Abstract

A python package for use Textboxes : Image Text Detection Model

implemented by tensorflow, cv2

Textboxes Paper Review in Korean (My Blog) : shinjayne.github.io/textboxes

shintb : useable textboxes python package (Source codes are in here)

svt1 : Street view Text dataset. can use with shintb.svt_data_loader.SVTDataLoader when training Textboxes model

config.py : (NECESSARY) configuration of model building and training with shinTB

main.py : simple example useage of shinTB package

Dependancies

python Version: 3.5.3
numpy Version: 1.13.0
tensorflow Version: 1.2.1
cv2

How to use

Clone this repository to your local.
You will use shintb python package and config.py for building and training your own Textboxes model.
svt1 gives us training / test data.
Open new python file.
Import config.config and shintb.

from config import config
from shintb import graph_drawer, default_box_control, svt_data_loader, runner

Initialize GraphDrawer,DefaultBoxControl,SVTDataLoader instance.

graphdrawer = graph_drawer.GraphDrawer(config)

dataloader = svt_data_loader.SVTDataLoader('./svt1/train.xml', './svt1/test.xml')

dbcontrol = default_box_control.DefaultBoxControl(config, graphdrawer)

GraphDrawer instance contains a tensorflow graph of Textboxes.
DefaultboxControl instance contains methods and attributes which is related to default box.
SVTDataLoader instance loads data from svt1.
Initialize Runner instance.

runner = runner.Runner(config, graphdrawer, dataloader, dbcontrol)

Runner uses GraphDrawer,DefaultBoxControl,SVTDataLoader instance.
If you want to train your Textboxes model, use Runner.train(). Every 1000 step, shintb will save ckpt file in the directory you set in config.py.

runner.train()

If you want to validate/test your model, use Runner.test()

runner.test()

After training, if you want to detect texts from one image use Runner.image().

runner.image(<your_image_directory>)

Textboxes : Image Text Detection Model : python package (tensorflow)

Related tags

Overview

shinTB

Abstract

Dependancies

How to use

Owner

Jayne Shin (신재인)

Camera Intrinsic Calibration and Hand-Eye Calibration in Pybullet

A tool to enhance your old/damaged pictures built using python & opencv.

Handwritten_Text_Recognition

WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching

A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).

7th place solution

第一届西安交通大学人工智能实践大赛（2018AI实践大赛--图片文字识别）第一名；仅采用densenet识别图中文字

Volume Control using OpenCV

This is an API written in python that uses FastAPI. It is a simple API that can detect discord tokens in Images.

A Python script to capture images from multiple webcams at once and save them into your local machine

The project is an official implementation of our paper "3D Human Pose Estimation with Spatial and Temporal Transformers".

📷 This repository is focused on having various feature implementation of OpenCV in Python.

Opencv face recognition desktop application

Multi-choice answer sheet correction system using computer vision with opencv & python.

computer vision, image processing and machine learning on the web browser or node.

Polaris is a Face recognition attendance system .

With the virtual keyboard, you can write on the real time images by combining the thumb and index fingers on the letter you want.

零样本学习测评基准，中文版

A list of hyperspectral image super-solution resources collected by Junjun Jiang

天池2021"全球人工智能技术创新大赛"【赛道一】：医学影像报告异常检测 - 第三名解决方案