S.H.R.A.V.A.N.

Simplifying Complex Tasks Across Platforms for Everyone

Built at hackCBS 7.0

Created on 10th November 2024

•

S.H.R.A.V.A.N.

Simplifying Complex Tasks Across Platforms for Everyone

The problem S.H.R.A.V.A.N. solves

Shravan AI is a voice-driven automation platform designed to simplify complex tasks across domains, allowing users to control IoT devices, access data, manage routines, and handle essential tasks—all through natural language commands. This accessibility-focused tool streamlines daily routines, making life easier, safer, and more efficient, especially for:

Individuals with Disabilities: Shravan AI provides hands-free control over devices and services, enabling users with physical limitations to interact with technology effortlessly.

Non-Technical Users: By eliminating setup complexity, Shravan AI allows anyone to execute powerful automation routines across devices using simple commands.

IoT and DIY Enthusiasts: The platform centralizes and simplifies IoT project integration, supporting users to define and execute commands without managing complex inter-device protocols.

Professionals and Traders: With features for managing portfolios, receiving financial updates, and executing trades remotely, Shravan AI supports productivity and responsiveness in fast-moving markets.

Remote Computing Tasks: Users can execute resource-intensive tasks remotely, making Shravan AI a cost-effective solution for high-computation needs like ML training and data processing.

Healthcare: Shravan AI enables hospitals to manage patient alerts, request information, and issue voice commands for routine tasks, reducing manual inputs and improving patient care.

Data Privacy & Security: With built-in NLP, Shravan AI minimizes third-party dependency, enhancing data privacy and creating a secure ecosystem for personal automation.

Shravan AI’s intuitive voice commands save time, promote accessibility, and streamline daily life by seamlessly bridging the gap between users and technology.

Challenges we ran into

Developing Shravan AI involved navigating several technical and architectural hurdles:

NLP Processing for Diverse Commands: Building a robust NLP system to accurately understand a wide range of voice commands was challenging. We needed to account for variations in speech patterns, accents, and command contexts. By integrating models like Ollama and open-source LLMs, and refining through extensive testing and feedback, we achieved precise command interpretation.

Reducing Latency in Real-Time Execution: Minimizing latency in voice command processing was crucial for a smooth user experience, particularly for time-sensitive tasks. To address this, we optimized backend architecture, implemented WebSockets for persistent connections, and applied caching strategies to ensure rapid command responses.

Ensuring Security for OS-Level Control: Allowing voice-activated OS control required rigorous security protocols to prevent unauthorized access. We implemented multi-layered authentication, role-based access control, and secure socket connections to ensure only authorized users could execute sensitive commands.

Integrating IoT and Cross-Platform Communication: Bridging varied IoT devices with different communication protocols required a flexible API framework. Our solution was a modular architecture allowing each device type to communicate independently, while maintaining a unified command structure within Shravan AI.

Managing Remote Resource Sharing and High-Computing Tasks: Facilitating remote task execution without overloading local resources presented challenges. We developed a distributed command execution network, enabling high-computation tasks to be efficiently transferred to remote servers with minimal setup, ensuring consistent performance.

These challenges drove us to refine system architecture, enhance command interpretation, and strengthen security measures, resulting in a reliable, accessible, and powerful voice automation platform.

Tracks Applied (1)

Best Use of MongoDB Atlas

We used mongodb atlas for storing history objects for the gemini chat to implement embeddings for the chat being done by...Read More

Major League Hacking

Technologies used

WebSockets

Selenium

Python

Express.js

MongoDB

reactjs

gemini

Ollama

sarvam ai

React Native (Expo)

Discussion

Builders also viewed

See more projects on Devfolio