Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"

Last update: Dec 30, 2022

Related tags

Computer Vision TableNet

Overview

TableNet

Unofficial implementation of ICDAR 2019 paper : TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images.

Paper

Overview

Paper: TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images

TableNet is a modern deep learning architecture that was proposed by a team from TCS Research year in the year 2019. The main motivation was to extract information from scanned tables through mobile phones or cameras.

They proposed a solution that includes accurate detection of the tabular region within an image and subsequently detecting and extracting information from the rows and columns of the detected table.

Architecture: The architecture is based out of Long et al., an encoder-decoder model for semantic segmentation. The same encoder/decoder network is used as the FCN architecture for table extraction. The images are preprocessed and modified using the Tesseract OCR.

Source: Nanonets

How to run

pip install -r requirements.txt

Download the Marmot Dataset from the link given in readme.
Run data_preprocess/generate_mask.py to generate Table and Column Mask of corresponding images.
Follow the TableNet.ipynb notebook to train and test the model.

Challenges

Require a very decent System with a good GPU for accurate result on High pixel images.

Dataset

Download the dataset provided in paper : Marmot Dataset.

Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"

Related tags

Overview

TableNet

Overview

How to run

Challenges

Dataset

Owner

Jainam Shah

Memory tests solver with using OpenCV

An interactive interface for using OpenCV's GrabCut algorithm for image segmentation.

Code for the paper: Fusformer: A Transformer-based Fusion Approach for Hyperspectral Image Super-resolution

Document Image Dewarping

Augmenting Anchors by the Detector Itself

This repo contains a script that allows us to find range of colors in images using openCV, and then convert them into geo vectors.

kaldi-asr/kaldi is the official location of the Kaldi project.

Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

Detect the mathematical formula from the given picture and the same formula is extracted and converted into the latex code

A simple python program to record security cam footage by detecting a face and body of a person in the frame.

Generate text images for training deep learning ocr model

一款基于Qt与OpenCV的仿真数字示波器

Morphological edge detection or object's boundary detection using erosion and dialation in OpenCV python

graph learning code for ogb

Balabobapy - Using artificial intelligence algorithms to continue the text

A simple OCR API server, seriously easy to be deployed by Docker, on Heroku as well

Computer vision applications project (Flask and OpenCV)

chineseocr/table_line 表格线检测模型pytorch版

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

Face Detection with DLIB