Skip to content
Visualaid

Visualaid

Visual Aid: Voice Assistant for the Visually Impaired

Created on 27th April 2025

Visualaid

Visualaid

Visual Aid: Voice Assistant for the Visually Impaired

The problem Visualaid solves

The Problem It Solves:
Visually impaired individuals struggle with everyday smartphone tasks like messaging, calling, reading emails, navigating, and controlling device settings. Current solutions are either complex or not integrated.
Our Visual Aid Voice Assistant solves this by providing a voice-controlled, hands-free solution for:

1)WhatsApp messaging, calls, and video calls
2)Email summarization
3)Live picture descriptions
4)System controls (volume, WiFi, Bluetooth,Brightness)
5)Real-time navigation and news summaries

Challenges we ran into

Challenges We Ran Into
1)Groq API Integration:
Challenge: Integrating Groq’s API with features like email summarization and live picture descriptions was initially difficult due to inconsistent documentation and API limitations.
Solution: We thoroughly reviewed the documentation and reached out to Groq support. After experimenting with different request formats and using robust error handling strategies, we were able to resolve API rate limits and timeouts.
2)Gmail Authentication:
Challenge: Setting up Gmail authentication with OAuth 2.0 for accessing unread emails was complicated by permission scopes and token refresh issues.
Solution: We implemented a proper flow using Google Auth Libraries and stored the credentials in token.json to persist authentication. Additionally, we ensured proper token expiry and refresh logic, which allowed smooth operation over time.
3)Voice Command Recognition:
Challenge: Achieving accurate voice command recognition in noisy environments was a significant hurdle.
Solution: We fine-tuned the microphone settings and implemented background noise reduction techniques. Additionally, we optimized the speech-to-text system and incorporated feedback mechanisms to improve the accuracy of command recognition in real-time.
4)Live Picture Descriptions:
Challenge: Processing live pictures in real-time for description was computationally expensive, especially on mobile devices.
Solution: We optimized the image processing pipeline by using OpenCV for image preprocessing. By reducing the resolution of images before sending them to the AI model for description, we achieved better speed and performance, ensuring smooth functionality even on lower-end devices.

Tracks Applied (1)

Groq track

How Our Project Fits into Groq: Groq Track Our project integrates Groq’s advanced AI models to create a voice assistant ...Read More
Groq

Groq

Discussion

Builders also viewed

See more projects on Devfolio