A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

Last update: Jan 17, 2022

Related tags

Overview

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

This is the repository for our Paper/Contribution to the WI2022 in Nürnberg.

Abstract

In recent years, large pre-trained deep neural networks (DNNs) have revolutionized the field of computer vision (CV). Although these DNNs have been shown to be very well suited for general image recognition tasks, application in industry is often precluded for three reasons:

large pre-trained DNNs are built on hundreds of millions of parameters, making deployment on many devices impossible,
the underlying dataset for pre-training consists of general objects, while industrial cases often consist of very specific objects, such as structures on solar wafers,
potentially biased pre-trained DNNs raise legal issues for companies.

As a remedy, we study neural networks for CV that we train from scratch. For this purpose, we use a real-world case from a solar wafer manufacturer. We find that our neural networks achieve similar performances as pre-trained DNNs, even though they consist of far fewer parameters and do not rely on third-party datasets.

Structure of this repository

+-- ImageClassification            | Runner Notebook + Scripts for experiments
+-- ReadMe.md			   | ReadMe
+-- Results.xlsx                   | Results that were reported in the paper
+-- RunResults                     | Detailed logging of our experiments results that were reported in the paper (IDs correspond to old IDs in the .xlsx file due to procedure)

You might also like...

Computer vision - fun segmentation experience using classic and deep tools :)

Computer_Vision_Segmentation_Fun Segmentation of Images and Video. Tools: pytorch Models: Classic model - GrabCut Deep model - Deeplabv3_resnet101 Flo

1 Dec 18, 2021

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision Project | Arxiv | Abstract It is very challenging for various visual tasks such as image

377 Jan 7, 2023

Unofficial PyTorch implementation of MobileViT based on paper "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".

MobileViT RegNet Unofficial PyTorch implementation of MobileViT based on paper MOBILEVIT: LIGHT-WEIGHT, GENERAL-PURPOSE, AND MOBILE-FRIENDLY VISION TR

91 Dec 2, 2022

Best Practices on Recommendation Systems

Recommenders What's New (February 4, 2021) We have a new relase Recommenders 2021.2! It comes with lots of bug fixes, optimizations and 3 new algorith

14.8k Jan 3, 2023

Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)

Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets This is the official implementation of "Towards Good Pract

52 Nov 22, 2022

A DeepStack custom model for detecting common objects in dark/night images and videos.

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

Related tags

Overview

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

Abstract

Structure of this repository

You might also like...

Computer vision - fun segmentation experience using classic and deep tools :)

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision

Unofficial PyTorch implementation of MobileViT based on paper "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".

Best Practices on Recommendation Systems

Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)

A DeepStack custom model for detecting common objects in dark/night images and videos.

An unofficial styleguide and best practices summary for PyTorch

Seeing Dynamic Scene in the Dark: High-Quality Video Dataset with Mechatronic Alignment (ICCV2021)

Dark Finix: All in one hacking framework with almost 100 tools

Releases(v1.0)

v1.0(Jan 5, 2022)

Owner

Maximilian Harl

This repository contains the code for Direct Molecular Conformation Generation (DMCG).

A parallel framework for population-based multi-agent reinforcement learning.

A PoC Corporation Relationship Knowledge Graph System on top of Nebula Graph.

NCVX (NonConVeX): A User-Friendly and Scalable Package for Nonconvex Optimization in Machine Learning.

The world's largest toxicity dataset.

Compares various time-series feature sets on computational performance, within-set structure, and between-set relationships.

[SIGGRAPH Asia 2021] Pose with Style: Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN

Codes for our paper "SentiLARE: Sentiment-Aware Language Representation Learning with Linguistic Knowledge" (EMNLP 2020)

An Open-Source Package for Information Retrieval.

Implementation of various Vision Transformers I found interesting

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Models Supported: AlbUNet [18, 34, 50, 101, 152] (1D and 2D versions for Single and Multiclass Segmentation, Feature Extraction with supports for Deep Supervision and Guided Attention)

Türkiye Canlı Mobese Görüntülerinde Profesyonel Nesne Takip Sistemi

A Tensorflow implementation of CapsNet based on Geoffrey Hinton's paper Dynamic Routing Between Capsules

ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation

This is the paddle code for SeBoW(Self-Born wiring for neural trees), a kind of neural tree born form a large search space

RATCHET is a Medical Transformer for Chest X-ray Diagnosis and Reporting

Code for "Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans" CVPR 2021 best paper candidate

A toy compiler that can convert Python scripts to pickle bytecode 🥒

MPLP: Metapath-Based Label Propagation for Heterogenous Graphs