Optimal Camera Position for a Practical Application of Gaze Estimation on Edge Devices,

Last update: Oct 10, 2022

Related tags

Overview

Optimal Camera Position for a Practical Application of Gaze Estimation on Edge Devices,
Linh Van Ma, Tin Trung Tran, Moongu Jeon, ICAIIC 2022 (The 4th International Conference on Artificial Intelligence in Information and Communication February 21 (Mon.) ~ 24 (Thur.), 2022, Guam, USA & Virtual Conference)

Gaze Estimation, Jetson Board Tx2, Realsense d435i Camera, Demo Video

How to run?

If you want to finetune this deep learning model. You first need to collect your dataset. You need to look at the center of each rectangle (36 rectangles).

python3 collect_dataset.py

Once you finish collecting your dataset. You need to change the folder of subject in run_finetune.py. Then, you can start finetuning this deep learning model.

python3 run_finetune.py

Remember to rebuild TensorRT if you first run this source in your device. You need to move your working folder to ext\tensorrt_mtcnn.

chmod +x ./build.sh
./build.sh

You now can run to test this gaze estimation by first connect a realsense camera to Jetson TX2. Run the following script.

python3 run_camera.py

To test with your recorded video, you should specify you video location in run_camera_test.py. Run the following script.

python3 run_camera_test.py

Dependencies

FAZE: Few-Shot Adaptive Gaze Estimation: https://github.com/NVlabs/few_shot_gaze
eos: https://github.com/patrikhuber/eos
HRNets: https://github.com/HRNet/HRNet-Facial-Landmark-Detection
mtcnn-pytorch: https://github.com/TropComplique/mtcnn-pytorch
Realtime-facial-landmark-detection: https://github.com/pathak-ashutosh/Realtime-facial-landmark-detection
MTCNN TensorRT(Demo #2: MTCNN): https://github.com/jkjung-avt/tensorrt_demos#mtcnn

5.1 TensorRT MTCNN Face Detector

5.2 Optimizing TensorRT MTCNN

Acknowledgement

A large part of the code is borrowed from FAZE: Few-Shot Adaptive Gaze Estimation and MTCNN TensorRT(Demo #2: MTCNN). Thanks for their wonderful works.

Optimal Camera Position for a Practical Application of Gaze Estimation on Edge Devices,

Related tags

Overview

Gaze Estimation, Jetson Board Tx2, Realsense d435i Camera, Demo Video

How to run?

Dependencies

Acknowledgement

Owner

Linh

Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)

auto-tuning momentum SGD optimizer

Official implementation for the paper "SAPE: Spatially-Adaptive Progressive Encoding for Neural Optimization".

Repo for the Tutorials of Day1-Day3 of the Nordic Probabilistic AI School 2021 (https://probabilistic.ai/)

AFLFast (extends AFL with Power Schedules)

Constraint-based geometry sketcher for blender

https://sites.google.com/cornell.edu/recsys2021tutorial

Code for the paper Task Agnostic Morphology Evolution.

Replication of Pix2Seq with Pretrained Model

An implementation demo of the ICLR 2021 paper Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural Networks in PyTorch.

General Assembly Capstone: NBA Game Predictor

Learning to Predict Gradients for Semi-Supervised Continual Learning

Code & Data for the Paper "Time Masking for Temporal Language Models", WSDM 2022

Time Series Forecasting with Temporal Fusion Transformer in Pytorch

Revisiting, benchmarking, and refining Heterogeneous Graph Neural Networks.

Realtime YOLO Monster Detection With Non Maximum Supression

FairMOT - A simple baseline for one-shot multi-object tracking

UMich 500-Level Mobile Robotics Course

Monitora la qualità della ricezione dei segnali radio nelle province siciliane.

Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image of a face as input and produces a 3D reconstruction. EMOCA sets the new standard on reconstructing highly emotional images in-the-wild