TextBoxes-TensorFlow

TextBoxes re-implementation using tensorflow. This project is greatly inspired by slim project And many functions are modified based on SSD-tensorflow project Later, we will overwrite this project so make it more flexiable and modularized.

Author: Daitao Xing : [email protected] Jin Huang : [email protected]

Progress

2017/ 03/14

data_processing phase finished Test：

1. Download the dataset， put 1/ folder and gt.mat uner ddata/sythtext/ folder（will wirte script）   
2. python datasets/data2record.py    
3. python image_processing.py

output： batch_size * 300 * 300 * 3 image

2017/ 03/17

Finish the design of training(can start training)

python train.py \
--train_dir=${TRAIN_DIR} \
--dataset_dir=${DATASET_DIR} \
--save_summaries_secs=60 \
--save_interval_secs=600 \
--weight_decay=0.0005 \
--optimizer=adam \
--learning_rate=0.001 \
--batch_size=32

Problems to be solved：

1. Need to redesign visualization		
2. image_processing can be improved

Next steps:

traing on other datasets
fine tunes
test
automatic downloading datasets and so on

TextBoxes re-implement using tensorflow

Related tags

Overview

TextBoxes-TensorFlow

Progress

Problems to be solved：

Next steps:

Owner

Gu Xiaodong

Train custom VR face tracking parameters

https://arxiv.org/abs/1904.01941

Detect the mathematical formula from the given picture and the same formula is extracted and converted into the latex code

A bot that extract text from images using the Tesseract OCR.

Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.

The official code for the ICCV-2021 paper "Speech Drives Templates: Co-Speech Gesture Synthesis with Learned Templates".

Image augmentation for machine learning experiments.

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

Omdena-abuja-anpd - Automatic Number Plate Detection for the security of lives and properties using Computer Vision.

Motion detector, Full body detection, Upper body detection, Cat face detection, Smile detection, Face detection (haar cascade), Silverware detection, Face detection (lbp), and Sending email notifications

MXNet OCR implementation. Including text recognition and detection.

Usando o Amazon Textract como OCR para Extração de Dados no DynamoDB

LEARN OPENCV IN 3 HOURS USING PYTHON - INCLUDING EXAMPLE PROJECTS

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Repository for Scene Text Detection with Supervised Pyramid Context Network with tensorflow.

Awesome anomaly detection in medical images

BNF Globalization Code (CVPR 2016)

Sign Language Recognition service utilizing a deep learning model with Long Short-Term Memory to perform sign language recognition.

OpenCV-Erlang/Elixir bindings

EAST for ICPR MTWI 2018 Challenge II (Text detection of network images)