A three-stage detection and recognition pipeline of complex meters in wild

This is the first released system towards detection and recognition of complex meters in wild. The system can be divided into three moduels. Fisrtly, a yolo-based detector is applied to get pure meter region. Secondly, a spatial transformer module is eatablished to rectify the position of meter. Lastly, an end-to-end network is to read meter values, which is implemented by pointer/dail predcition and key number learning.

Visulization results

Left row is the original image, middle row is the process of meter rectification, right row is the result of meter value reading.

ToDo List

Installation

Requirements:

Python3 (Python3.7 is recommended)
PyTorch >= 1.0
torchvision from master
numpy
skimage
OpenCV==3.0.x
CUDA >= 9.0 (10.0 is recommended)

Models

Download Trained model

Please put distro_net.pt into meter_distro/weight.
put textgraph_vgg_450.pth into model/meter_data.

Demo

You can run a demo script for a single image inference by two steps.

python get_meter_area.py. and the detected meter will be stored in scene_image_data/deteced_meter

python predict.py to get distored meter and final result.

This is the first released system towards complex meters` detection and recognition, which is implemented by computer vision techniques.

Related tags

Overview

A three-stage detection and recognition pipeline of complex meters in wild

Visulization results

ToDo List

Installation

Requirements:

Models

Demo

Owner

Yan Shu

Official Code for "Non-deep Networks"

Hierarchical Memory Matching Network for Video Object Segmentation (ICCV 2021)

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

DEMix Layers for Modular Language Modeling

🌾 PASTIS 🌾 Panoptic Agricultural Satellite TIme Series

🔥 Cogitare - A Modern, Fast, and Modular Deep Learning and Machine Learning framework for Python

Code for the paper "Attention Approximates Sparse Distributed Memory"

[Pedestron] Generalizable Pedestrian Detection: The Elephant In The Room. @ CVPR2021

Identifying a Training-Set Attack’s Target Using Renormalized Influence Estimation

PyTorch implementation of image classification models for CIFAR-10/CIFAR-100/MNIST/FashionMNIST/Kuzushiji-MNIST/ImageNet

Simulation-based inference for the Galactic Center Excess

LightNet++: Boosted Light-weighted Networks for Real-time Semantic Segmentation

Continuous Time LiDAR odometry

Certified Patch Robustness via Smoothed Vision Transformers

Supplementary materials to "Spin-optomechanical quantum interface enabled by an ultrasmall mechanical and optical mode volume cavity" by H. Raniwala, S. Krastanov, M. Eichenfield, and D. R. Englund, 2022

Simulations for Turring patterns on an apically expanding domain. T

EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections

SIEM Logstash parsing for more than hundred technologies

Alpha-IoU: A Family of Power Intersection over Union Losses for Bounding Box Regression

Tackling data scarcity in Speech Translation using zero-shot multilingual Machine Translation techniques