This is a project to detect gestures to zoom in or out, using the real-time distance between the index finger and the thumb. It's based on OpenCV and Mediapipe.

Last update: Jul 11, 2022

Related tags

Overview

Pinch-zoom

This is a python project based on real-time hand-gesture detection, to zoom in or out, using the distance between the index finger and the thumb. It's based on OpenCV, Mediapipe and pyautogui. MediaPipe offers open source cross-platform, customizable ML solutions for live and streaming media, created by Google. Pyautogui is used to communicate with the keyboard and mouse, and control them by python.

To start using this, clone this repository and run

pip install -r requirement.txt

Then, run pinchZoom.py to start camera and use your index and thumb fingers to pinch zoom on an area.

Press any key to exit.

Owner

Harshit Bhalla

GitHub Repository

(CVPR 2021) Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds

BRNet Introduction This is a release of the code of our paper Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds,

86 Oct 05, 2022

Scan the MRZ code of a passport and extract the firstname, lastname, passport number, nationality, date of birth, expiration date and personal numer.

PassportScanner Works with 2 and 3 line identity documents. What is this With PassportScanner you can use your camera to scan the MRZ code of a passpo

441 Dec 24, 2022

This is a project to detect gestures to zoom in or out, using the real-time distance between the index finger and the thumb. It's based on OpenCV and Mediapipe.

Related tags

Overview

Pinch-zoom

Owner

Harshit Bhalla

(CVPR 2021) Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds

Scan the MRZ code of a passport and extract the firstname, lastname, passport number, nationality, date of birth, expiration date and personal numer.

Brief idea about our project is mentioned in project presentation file.

Using python libraries to track hands

Table Extraction Tool

Generic framework for historical document processing

Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)

A post-processing tool for scanned sheets of paper.

A curated list of promising OCR resources

An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

FastOCR is a desktop application for OCR API.

✌️Using this you can control your PC/Laptop volume by Hand Gestures created with Python.

Python-based tools for document analysis and OCR

Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

A toolbox of scene text detection and recognition

Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining

Smart computer vision application

keras复现场景文本检测网络CPTN: 《Detecting Text in Natural Image with Connectionist Text Proposal Network》；欢迎试用，关注，并反馈问题...

Camelot: PDF Table Extraction for Humans