Gesture-Detection-and-Depth-Estimation

This is my graduation project.

(1) In this project, I use the YOLOv3 object detection model to detect gesture in RGB image. I trained the model on the self-made gesture dataset to obtain the gesture detection model based on deep learning. Then by testing the model on the test dataset, I found that the model can meet the requirements of real-time gesture detection while maintaining high accuracy.

(2) Then I tried to use the monocular depth estimation algorithm based on depth learning to estimate the depth of gesture object from a single RGB image, including FastDepth algorithm and the improved detection model based on YOLOv3. The FastDepth algorithm is trained and tested on the self-made gesture-depth dataset. Then, by adding a depth vector to output dimensions and modifying the loss function, the function of estimating target depth is added to the YOLOv3 model. Then I trained and tested the modified YOLOv3 model on the same gesture-depth dataset. Finally, the experiment results show that both methods can estimate the depth information of gesture object in RGB image to a certain extent.

Gesture detection:

Depth data:

Estimate target depth：

(3) Also, I developed a simple program with PyOpenGL that can use gesture information to draw simple shapes in three-dimensional space.

Try to draw a cube:

For more information, you can check my final paper.

YOLOv3 model is based on coldlarry's model: https://github.com/coldlarry/YOLOv3-complete-pruning

Graduation Project

Related tags

Overview

Gesture-Detection-and-Depth-Estimation

Owner

ChaosAT

Myia prototyping

Understanding Convolutional Neural Networks from Theoretical Perspective via Volterra Convolution

Cross-Modal Contrastive Learning for Text-to-Image Generation

Official PyTorch implementation of "Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning" (AAAI 2021)

Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral)

This repository lets you interact with Lean through a REPL.

Cortex-compatible model server for Python and TensorFlow

The Empirical Investigation of Representation Learning for Imitation (EIRLI)

Manifold-Mixup implementation for fastai V2

Modifications of the official PyTorch implementation of StyleGAN3. Let's easily generate images and videos with StyleGAN2/2-ADA/3!

Equivariant layers for RC-complement symmetry in DNA sequence data

AbelNN: Deep Learning Python module from scratch

(ICCV 2021) Official code of "Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing."

Basit bir burç modülü.

Fast methods to work with hydro- and topography data in pure Python.

Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data

Official repository for ABC-GAN

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2020

《Train in Germany, Test in The USA: Making 3D Object Detectors Generalize》(CVPR 2020)

This library contains a Tensorflow implementation of the paper Stability Analysis of Unfolded WMMSE for Power Allocation