studentfolio

learn anything, master everything

400

Built at Warpspeed: Agentic AI Hackathon | Lightspeed India

Sarvam.ai: First Prize: Sarvam Track

Created on 22nd June 2025

•

studentfolio

learn anything, master everything

The problem studentfolio solves

Personalized mentorship at scale

People often need instant, contextual feedback whether studying, debugging, or learning something new. Without access to a live tutor or peer helper, progress stalls.

Bridging the “learning gap” in real time

When learning complex material (e.g. code, technical docs, math), confusion often strikes in the moment. Waiting for peer review or instructor replies disrupts momentum and motivation.

Challenges we ran into

1.Reliable Voice Capture & VAD

Mic input varies; distinguishing speech vs background noise and detecting when to start/stop listening is tricky.

Calibration across different hardware and acoustic environments adds complexity.

2.Accurate Transcription & Fallbacks

STT errors lead to misunderstandings; API latency adds delay.

You mitigate this by layering services (Sarvam), but integrating them and handling failures robustly is engineering-heavy.

3. Multimodal Processing with LLM

Combining text (speech transcript) and visual input (real time video streaming) for coherent answers is non-trivial. Synchronizing them into one prompt requires careful design.

Progress made before hackathon

Progress Made Before the Hackathon: Research
Before the hackathon, significant groundwork was laid through comprehensive research to deeply understand the fundamental problems in modern education and the feasibility of using generative AI for real-time, contextual mentorship. Key areas explored include:

Identifying the Core Educational Gap

Analyzed data from global education reports (OECD, UNESCO, etc.) to understand the mismatch between learning outcomes and industry needs.

Studied the limitations of traditional feedback mechanisms in classrooms and online platforms—highlighting the lack of instant support and personalized attention as major bottlenecks to learner progress.

Feasibility of AI-Powered Mentorship

Conducted a technical survey of large language models, speech-to-text, and vision-language systems capable of enabling real-time, multimodal interaction.

Evaluated TTS/STT tools (e.g., Whisper, Sarvam API), visual context tools (screenshot analysis), and prompt engineering strategies for contextual assistance in educational workflows.

User Personas & Learning Pain Points

Interviewed target user groups (students, self-learners, bootcamp participants) to map real-world friction points: delayed help, complex course materials, language barriers, and lack of mentorship.

Validated the need for context-aware AI assistance that integrates voice, visuals, and curriculum-based guidance.

Tracks Applied (2)

Sarvam AI Track

Assistive agents India is a country of 1.4+ billion people, with 22 official languages and hundreds of dialects,yet much...Read More

Sarvam.ai

Google Cloud Platform Usage

ur project is a voice-first, screenshot-aware AI assistant that delivers real-time feedback during learning, combining t...Read More

Google Cloud Platform

Technologies used

Next.js

Python

TypeScript

Groq Cloud

Amazon Bedrock

Sarvam-AI

Google-adk

google-gcp

google generativeai-sdk

Cheer Project

Cheering for a project means supporting a project you like with as little as 0.0025 ETH. Right now, you can Cheer using ETH on Arbitrum, Optimism and Base.

Discussion

Builders also viewed

See more projects on Devfolio