Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network.

Last update: Dec 27, 2022

Overview

Dewarping Document Image

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network.

Please browse 90_paper.pdf

Dewarping Process

We predict the displacement and the categories (foreground or background) at pixellevel by applying two tasks in FCN, and then remove the background of the input image, and mapped the foreground pixels to rectified image by interpolation according to the predicted displacements. The cracks maybe emerge in rectified image when using a forward mapping interpolation. Therefore, we construct Delaunay triangulations in all scattered pixels and then using interpolation.

Compare

Notice

2020.11.10 update the result file, including 6-25_11_52_54-49-rgb_ and 6-25_11_52_54-49_.
2022.2.17 update the Release Code.
2022.4.14 update Source file.

Release Code

The source code is open, please download from Source.

Please send an email to [email protected].

Running

1、Download model parameter and source codes

2、Resize the input image into 1024x960 (zooming in or out along the longest side and keeping the aspect ration, then filling zero for padding. )

3、Run python test.py --data_path_test=./dataset/shrink_1024_960/crop/

Training

Run python train.py

Dataset

The training dataset can be synthesised using the scripts.

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network.

Related tags

Overview

Dewarping Document Image

Dewarping Process

Compare

Notice

Release Code

Running

Training

Dataset

Owner

img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation

MLP-Like Vision Permutator for Visual Recognition (PyTorch)

Procedural 3D data generation pipeline for architecture

Computer-Vision-Paper-Reviews - Computer Vision Paper Reviews with Key Summary along Papers & Codes

Patch Rotation: A Self-Supervised Auxiliary Task for Robustness and Accuracy of Supervised Models

Mixed Transformer UNet for Medical Image Segmentation

Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."

LEDNet: A Lightweight Encoder-Decoder Network for Real-time Semantic Segmentation

La source de mon module 'pyfade' disponible sur Pypi.

A simple configurable bot for sending arXiv article alert by mail

Implementation for paper LadderNet: Multi-path networks based on U-Net for medical image segmentation

3D dataset of humans Manipulating Objects in-the-Wild (MOW)

Node Editor Plug for Blender

Memory efficient transducer loss computation

CVNets: A library for training computer vision networks

Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation"

A light and fast one class detection framework for edge devices. We provide face detector, head detector, pedestrian detector, vehicle detector......

The official repository for "Revealing unforeseen diagnostic image features with deep learning by detecting cardiovascular diseases from apical four-chamber ultrasounds"

working repo for my xumx-sliCQ submissions to the ISMIR 2021 MDX

Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17