A little but useful tool to explore OCR data extracted with `pytesseract` and `opencv`

Last update: Dec 07, 2021

Overview

Screenshot OCR Tool

Extracting data from screen time screenshots in iOS and Android. We are exploring 3 options:

Simple OCR with no text position using pytesseract and OpenCV. We can then try and extract info with regex
Extract text and its position from each screenshot, classify data according to its position in the screenshot
Use YOLOv4 to extract some features from the screenshot and then use those features to train a ML model.
- https://arxiv.org/abs/2004.10934
- https://github.com/AlexeyAB/darknet

Instructions

So far there is not much to do really:

Add your screenshots in each folder
Run the script and wait for the tkinter window to show up
The panel on the right lets you explore the text extracted by pytesseract
Clicking on each top-level text in the tree view will highlight the text on the screenshot in red

Owner

Gabriele Marini

Passionate about technology, photography and travelling. I am currently a PhD student at the University of Melbourne

GitHub Repository

Table recognition inside douments using neural networks

TableTrainNet A simple project for training and testing table recognition in documents. This project was developed to make a neural network which reco

93 Jul 24, 2022

Face_mosaic - Mosaic blur processing is applied to multiple faces appearing in the video

動機 face_recognitionを使用して得られる顔座標は長方形であり、この座標をそのまま用いてぼかし処理を行った場合得られる画像は醜い。それに対してモ

6 Feb 03, 2022

Read Japanese manga inside browser with selectable text.

mokuro Read Japanese manga with selectable text inside a browser. See demo: https://kha-white.github.io/manga-demo mokuro_demo.mp4 Demo contains excer

170 Dec 27, 2022

SemTorch

SemTorch This repository contains different deep learning architectures definitions that can be applied to image segmentation. All the architectures a

154 Dec 07, 2022

Tensorflow-based CNN+LSTM trained with CTC-loss for OCR

Overview This collection demonstrates how to construct and train a deep, bidirectional stacked LSTM using CNN features as input with CTC loss to perfo

489 Dec 21, 2022

Line based ATR Engine based on OCRopy

OCR Engine based on OCRopy and Kraken using python3. It is designed to both be easy to use from the command line but also be modular to be integrated

948 Dec 23, 2022

Write-ups for the SwissHackingChallenge2021 CTF.

SwissHackingChallenge 2021 : Write-ups This repository contains a collection of my write-ups for challenges solved during the SwissHackingChallenge (S

3 Jun 07, 2021

The code for “Oriented RepPoints for Aerail Object Detection”

Oriented RepPoints for Aerial Object Detection The code for the implementation of “Oriented RepPoints”, Under review. (arXiv preprint) Introduction Or

207 Dec 24, 2022

Fatigue Driving Detection Based on Dlib

5 Dec 14, 2022

An advanced 2D image manipulation with features such as edge detection and image segmentation built using OpenCV

OpenCV-ToothPaint3-Advanced-Digital-Image-Editor This application named ‘Tooth Paint’ version TP_2020.3 (64-bit) or version 3 was developed within a w

1 Nov 05, 2021

Visual Attention based OCR

Attention-OCR Authours: Qi Guo and Yuntian Deng Visual Attention based OCR. The model first runs a sliding CNN on the image (images are resized to hei

1.1k Jan 02, 2023

Super Mario Game With Python

Super_Mario Hello all this is a simple python program which tries to use our body as a controller for the super mario game Here I have used media pipe

219 Nov 25, 2022

Fun program to overlay a mask to yourself using a webcam

Superhero Mask Overlay Description Simple project made for fun. It consists of placing a mask (a PNG image with transparent background) on your face.

10 Dec 01, 2022

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

SynthText Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Ved

1.8k Dec 28, 2022

With the virtual keyboard, you can write on the real time images by combining the thumb and index fingers on the letter you want.

Virtual Keyboard With the virtual keyboard, you can write on the real time images by combining the thumb and index fingers on the letter you want. At

5 Jan 23, 2022