Yoga Pose Identification and Icon Matching

Project Goal

Detect yoga poses performed by a user and overlay a corresponding icon image. Running the main script starts the videostream with automatic pose detection.

Part 1: Pose Detection

I use the 32 body landmarks provided by MediaPipe to measure joint angles, then determine yoga poses based on key joint angles for each pose. For example, in the star pose, the angle between the shoulder, elbow, and wrist landmarks (elbow flexion) are below 20 degrees and the angle of the elbow, shoulder, and opposite shoulder (shoulder flexion) are also below 20 degrees.

Part 2: Icon Image Transformation

To transform the icon image that will be overlayed over the user, I first preprocess the icon image then apply an affine transform. To preprocess the icon, I resize the icon image to be roughly the same heigt as the user, a metric also calculated with MediaPie's landmarks. I then apply a border to the icon image so that its image array has the same dimensions as the video stream frames. These steps help make the affine transform more effective. I select three key pose landmarks for each pose, then find three key points on the icon that should match these points. For example, I chose to match the nose and ankles of the person with the top tip and bottom two tips of the star.

Part 3: Image Overlay

I overlayed just the icon pixels (the icon background is ignored) by summing .5 of the icon pixel value with .5 of the the video frame value, resulting in a transparent overlay of just the icon.

OpenCV, MediaPipe Pose Estimation, Affine Transform for Icon Overlay

Related tags

Overview

Yoga Pose Identification and Icon Matching

Project Goal

Part 1: Pose Detection

Part 2: Icon Image Transformation

Part 3: Image Overlay

Results

Star Pose

Tree Pose

Chair pose

Owner

Anna Garverick

Code to reproduce experiments in the paper "Explainability Requires Interactivity".

Texture mapping with variational auto-encoders

MaRS - a recursive filtering framework that allows for truly modular multi-sensor integration

TrackFormer: Multi-Object Tracking with Transformers

Python library to receive live stream events like comments and gifts in realtime from TikTok LIVE.

Label-Free Model Evaluation with Semi-Structured Dataset Representations

LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation

Source code for 2021 ICCV paper "In-the-Wild Single Camera 3D Reconstruction Through Moving Water Surfaces"

Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving

[CVPR 2022 Oral] EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

Some methods for comparing network representations in deep learning and neuroscience.

System Combination for Grammatical Error Correction Based on Integer Programming

Meta-meta-learning with evolution and plasticity

Get the partition that a file belongs and the percentage of space that consumes

Implementation of: "Exploring Randomly Wired Neural Networks for Image Recognition"

Iris prediction model is used to classify iris species created julia's DecisionTree, DataFrames, JLD2, PlotlyJS and Statistics packages.

[CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning

An Intelligent Self-driving Truck System For Highway Transportation

Evaluating AlexNet features at various depths

Fast image augmentation library and an easy-to-use wrapper around other libraries