In this project we will be using the live feed coming from the webcam to create a virtual mouse with complete functionalities.

Last update: Dec 20, 2022

Overview

Virtual Mouse Using OpenCV

In this project we will be using the live feed coming from the webcam to create a virtual mouse using hand tracking.

Project Description:

In this project, I am using my hand as a virtual mouse than can do everything that a mouse does without even touching your system. I am using the webcam of my system to detect my hands. It will then create a bounding box around my hand and focus on two fingers: The fore finger and the middle finger. The fore finger will act as a cursor and moving it around, we will be moving the cursor around. Now, inorder to successfully click using hand tracking, it is detecting the distance between the fore finger and the middle finger. If they are joined together, then it will perform a click.

Furthermore, a smoothness factor was added as the movement was really shaky.

Requirements:

Following modules need to be installed for it to work properly:

OpenCV
Mediapipe
Autopy

OpenCV:

OpenCV is a huge open-source library for computer vision, machine learning, and image processing. OpenCV supports a wide variety of programming languages like Python, C++, Java, etc. It can process images and videos to identify objects, faces, or even the handwriting of a human.

It can be installed using "pip install opencv-python"

Mediapipe:

MediaPipe is a framework for building multimodal (eg. video, audio, any time series data), cross platform (i.e Android, iOS, web, edge devices) applied ML pipelines.

It can be installed using "pip install mediapipe"

Autopy:

AutoPy is a simple, cross-platform GUI automation library for Python. It includes functions for controlling the keyboard and mouse, finding colors and bitmaps on-screen, and displaying alerts.

It can be installed using "pip install autopy"

Important Note:

I faced alot of dependency issues throughout this project. Some of the issues and their solutions are as follows:

autopy not installing: This is because autopy currently doesn't support Python versions above 3.8
webcam not opening: It was a bug in mediapipe and was fixed in latest python versions

Hence, inorder for the project to run smoothly, you need to degrade the Python version to 3.8

How to Degrade Python Version:

Follow the following steps:

Uninstall Python from add/remove programs
Go to AppData and remove any python folder you see.
Download Python 3.8 from this link : Python 3.8
Install it.
Open command promt and run "pip" inorder to confirm installation.
Your Python version has been degraded :)

Contact Information:

For any further queries, feel free to contact me at:

Email: [email protected]

LinkedIn : Hassan Shahzad

In this project we will be using the live feed coming from the webcam to create a virtual mouse with complete functionalities.

Related tags

Overview

Virtual Mouse Using OpenCV

Project Description:

Requirements:

OpenCV:

Mediapipe:

Autopy:

Important Note:

How to Degrade Python Version:

Contact Information:

Owner

Hassan Shahzad

Implement 'Single Shot Text Detector with Regional Attention, ICCV 2017 Spotlight'

a Deep Learning Framework for Text

This repo contains a script that allows us to find range of colors in images using openCV, and then convert them into geo vectors.

An interactive interface for using OpenCV's GrabCut algorithm for image segmentation.

Python tool that takes the OCR.space JSON output as input and draws a text overlay on top of the image.

A curated list of papers and resources for scene text detection and recognition

[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别

Using python libraries to track hands

Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).

Sort By Face

An application of high resolution GANs to dewarp images of perturbed documents

Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector

Source code of RRPN ---- Arbitrary-Oriented Scene Text Detection via Rotation Proposals

GDB python tool to pretty print and debug c++ xtensor containers

Apply different text recognition services to images of handwritten documents.

~1000 book pages + OpenCV + python = page regions identified as paragraphs, lines, images, captions, etc.

A python scripts that uses 3 different feature extraction methods such as SIFT, SURF and ORB to find a book in a video clip and project trailer of a movie based on that book, on to it.

The project is an official implementation of our paper "3D Human Pose Estimation with Spatial and Temporal Transformers".

Isearch (OSINT) 🔎 Face recognition reverse image search on Instagram profile feed photos.

Repository relating to the CVPR21 paper TimeLens: Event-based Video Frame Interpolation