TextBoxes++-TensorFlow

TextBoxes++ re-implementation using tensorflow. This project is greatly inspired by slim project And many functions are modified based on SSD-tensorflow project

Author: Zhisheng Zou [email protected]

pretrained model

Google drive

environment

python2.7/python3.5

tensorflow-gpu 1.8.0

at least one gpu

how to use

Getting the xml file like this example xml and put the image together because we need the format like this standard xml
1. picture format: *.png or *.PNG
Getting the xml and flags ensure the XML file is under the same directory as the corresponding image.execute the code: convert_xml_format.py
1. python tools/convert_xml_format.py -i in_dir -s split_flag -l save_logs -o output_dir
2. in_dir means the absolute directory which contains the pic and xml
3. split_flag means whether or not to split the datasets
4. save_logs means whether to save train_xml.txt
5. output_dir means where to save xmls
Getting the tfrecords
1. python gene_tfrecords.py --xml_img_txt_path=./logs/train_xml.txt --output_dir=tfrecords
2. xml_img_txt_path like this train xml
3. output_dir means where to save tfrecords
Training
1. python train.py --train_dir =some_path --dataset_dir=some_path --checkpoint_path=some_path
2. train_dir store the checkpoints when training
3. dataset_dir store the tfrecords for training
4. checkpoint_path store the model which needs to be fine tuned
Testing
1. python test.py -m /home/model.ckpt-858 -o test
2. -m which means the model
3. -o which means output_result_dir
4. -i which means the test img dir
5. -c which means use which device to run the test
6. -n which means the nms threshold
7. -s which means the score threshold

Note:

when you are training the model, you can run the eval_result.py to eval your model and save the result

Textboxes_plusplus implementation with Tensorflow (python)

Related tags

Overview

TextBoxes++-TensorFlow

pretrained model

environment

how to use

Note:

Owner

Automatic Number Plate Recognition (ANPR) is a highly accurate system capable of reading vehicle number plates without human intervention

Textboxes : Image Text Detection Model : python package (tensorflow)

CNN+Attention+Seq2Seq

This project is basically to draw lines with your hand, using python, opencv, mediapipe.

Reference Code for AAAI-20 paper "Multi-Stage Self-Supervised Learning for Graph Convolutional Networks on Graphs with Few Labels"

(CVPR 2021) ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection

Captcha Recognition

ARU-Net - Deep Learning Chinese Word Segment

Handwritten Character Recognition using CNN

Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

Text page dewarping using a "cubic sheet" model

A simple Security Camera created using Opencv in Python where images gets saved in realtime in your Dropbox account at every 5 seconds

Binarize document images

A facial recognition program that plays a alarm (mp3 file) when a person i seen in the room. A basic theif using Python and OpenCV

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

Sign Language Recognition service utilizing a deep learning model with Long Short-Term Memory to perform sign language recognition.

[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别

Text modding tools for FF7R (Final Fantasy VII Remake)

Code for the paper "Controllable Video Captioning with an Exemplar Sentence"

A simple component to display annotated text in Streamlit apps.