GuideDog is an AI/ML-based mobile app designed to assist the lives of the visually impaired, 100% voice-controlled

Last update: Nov 24, 2021

Related tags

Overview

Guidedog

Authors: Kyuhee Jo, Steven Gunarso, Jacky Wang, Raghav Sharma

GuideDog is an AI/ML-based mobile app designed to assist the lives of the visually impaired, 100% voice-controlled. You may as well think of it as "speaking guide dog," as the name suggests. It has three key features based on the scene captured by your mobile phone:

Reads text upon command
Describes the scene around you upon command
Warns you if there is an obstacle in front of you

Check out this demo video to learn more about our app!

Android App

UI/UX
- Simple and Responsive
- Voice Assistant architecture for targeted audience
Libraries / APIs
- GC Speech-to-text and Text-to-Speech
- Android SDK , androidX
- ML Kit object detection and tracking api
- TensorFlow Lite MobileNet Image Classification Model

Backend

Flask API
- Image Captioning
- Optical Character Recognition
Deployment
- Google App Engine
- fast central API with different endpoints

Image Captioning

We used tensorflow to build and train model for image captioning on MS-COCO 2014 based on the paper Show, Attend and Tell: Neural Image Caption Generation with Visual Attention. The model uses standard convolutional network as an encoder to extract features from images (we use Inception V3) and feed the generated features into an attention-based decoder generate sentences. While the paper used LSTM model as a decoder, we use a simpler RNN instead.

GuideDog is an AI/ML-based mobile app designed to assist the lives of the visually impaired, 100% voice-controlled

Related tags

Overview

Guidedog

Android App

Backend

Image Captioning

Get more insights : Devpost

Owner

Kyuhee Jo

Generates all variables from your .tf files into a variables.tf file.

Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

Animation of solving the traveling salesman problem to optimality using mixed-integer programming and iteratively eliminating sub tours

🎯 A comprehensive gradient-free optimization framework written in Python

Code for "Solving Graph-based Public Good Games with Tree Search and Imitation Learning"

The code of paper "Block Modeling-Guided Graph Convolutional Neural Networks".

ML-Decoder: Scalable and Versatile Classification Head

PyTorch implementation of our ICCV 2019 paper: Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis

A PyTorch implementation of "DGC-Net: Dense Geometric Correspondence Network"

Koopman operator identification library in Python

Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)

The materials used in the SaxonJS tutorial presented at Declarative Amsterdam, 2021

This is the official pytorch implementation of the BoxEL for the description logic EL++

Tutel MoE: An Optimized Mixture-of-Experts Implementation

moving object detection for satellite videos.

Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'

Neural Cellular Automata + CLIP

[NeurIPS 2021] Introspective Distillation for Robust Question Answering

A simple and useful implementation of LPIPS.

GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles