A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

Last update: Jan 17, 2022

Related tags

Overview

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

This is the repository for our Paper/Contribution to the WI2022 in Nürnberg.

Abstract

In recent years, large pre-trained deep neural networks (DNNs) have revolutionized the field of computer vision (CV). Although these DNNs have been shown to be very well suited for general image recognition tasks, application in industry is often precluded for three reasons:

large pre-trained DNNs are built on hundreds of millions of parameters, making deployment on many devices impossible,
the underlying dataset for pre-training consists of general objects, while industrial cases often consist of very specific objects, such as structures on solar wafers,
potentially biased pre-trained DNNs raise legal issues for companies.

As a remedy, we study neural networks for CV that we train from scratch. For this purpose, we use a real-world case from a solar wafer manufacturer. We find that our neural networks achieve similar performances as pre-trained DNNs, even though they consist of far fewer parameters and do not rely on third-party datasets.

Structure of this repository

+-- ImageClassification            | Runner Notebook + Scripts for experiments
+-- ReadMe.md			   | ReadMe
+-- Results.xlsx                   | Results that were reported in the paper
+-- RunResults                     | Detailed logging of our experiments results that were reported in the paper (IDs correspond to old IDs in the .xlsx file due to procedure)

You might also like...

Computer vision - fun segmentation experience using classic and deep tools :)

Computer_Vision_Segmentation_Fun Segmentation of Images and Video. Tools: pytorch Models: Classic model - GrabCut Deep model - Deeplabv3_resnet101 Flo

1 Dec 18, 2021

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision Project | Arxiv | Abstract It is very challenging for various visual tasks such as image

377 Jan 7, 2023

Unofficial PyTorch implementation of MobileViT based on paper "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".

MobileViT RegNet Unofficial PyTorch implementation of MobileViT based on paper MOBILEVIT: LIGHT-WEIGHT, GENERAL-PURPOSE, AND MOBILE-FRIENDLY VISION TR

91 Dec 2, 2022

Best Practices on Recommendation Systems

Recommenders What's New (February 4, 2021) We have a new relase Recommenders 2021.2! It comes with lots of bug fixes, optimizations and 3 new algorith

14.8k Jan 3, 2023

Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)

Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets This is the official implementation of "Towards Good Pract

52 Nov 22, 2022

A DeepStack custom model for detecting common objects in dark/night images and videos.

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

Related tags

Overview

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

Abstract

Structure of this repository

You might also like...

Computer vision - fun segmentation experience using classic and deep tools :)

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision

Unofficial PyTorch implementation of MobileViT based on paper "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".

Best Practices on Recommendation Systems

Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)

A DeepStack custom model for detecting common objects in dark/night images and videos.

An unofficial styleguide and best practices summary for PyTorch

Seeing Dynamic Scene in the Dark: High-Quality Video Dataset with Mechatronic Alignment (ICCV2021)

Dark Finix: All in one hacking framework with almost 100 tools

Releases(v1.0)

v1.0(Jan 5, 2022)

Owner

Maximilian Harl

YOLOv5🚀 reproduction by Guo Quanhao using PaddlePaddle

Neighborhood Contrastive Learning for Novel Class Discovery

Out-of-Domain Human Mesh Reconstruction via Dynamic Bilevel Online Adaptation

GraPE is a Rust/Python library for high-performance Graph Processing and Embedding.

(ImageNet pretrained models) The official pytorch implemention of the TPAMI paper "Res2Net: A New Multi-scale Backbone Architecture"

[ICCV 2021 Oral] Mining Latent Classes for Few-shot Segmentation

🏆 The 1st Place Submission to AICity Challenge 2021 Natural Language-Based Vehicle Retrieval Track (Alibaba-UTS submission)

AdaNet is a lightweight TensorFlow-based framework for automatically learning high-quality models with minimal expert intervention

a general-purpose Transformer based vision backbone

Background Matting: The World is Your Green Screen

an implementation of softmax splatting for differentiable forward warping using PyTorch

Implementation of SiameseXML (ICML 2021)

PyTorch implementation of PSPNet

Quantile Regression DQN a Minimal Working Example, Distributional Reinforcement Learning with Quantile Regression

Official PyTorch implementation of CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point Clouds

Music source separation is a task to separate audio recordings into individual sources

Learning to trade under the reinforcement learning framework

Library for fast text representation and classification.

[CVPR 2020] Interpreting the Latent Space of GANs for Semantic Face Editing

Continuous Time LiDAR odometry