TeachinAssistant

AI teaching assistant chatbot understands voice queries in 50+ languages via Whisper-1. It processes input through a LLM, considering chat history and topic knowledge using RAG pipeline.

Built at Aurora2024

Created on 6th October 2024

•

TeachinAssistant

AI teaching assistant chatbot understands voice queries in 50+ languages via Whisper-1. It processes input through a LLM, considering chat history and topic knowledge using RAG pipeline.

The problem TeachinAssistant solves

This project features an interactive AI-powered teaching assistant chatbot capable of understanding voice input of student queries in over 50 languages, using Whisper-1 model, which also removes background noise for clearer communication. This input is then processed by a LLM tailored to match specific Teaching assistant personalities, considering chat history and Topic knowledge stored in a vector database via Retrieval Augmented Generation (RAG). The result is a text response that ensures engaging interactions with Students.

Challenges I ran into

Issue: The combination of real-time voice transcription using Whisper-1, language model processing, and retrieval from the vector database introduced delays that pushed the response time beyond acceptable limits. Our goal was to reduce latency to under 500ms to ensure smooth, real-time interaction.
Solution: In the Retrieval Augmented Generation (RAG) pipeline, the retrieval step was one of the bottlenecks. To mitigate this, we optimized the vector database by implementing indexed search with asynchronous pre-fetching. This allowed the system to fetch relevant knowledge in parallel with the LLM processing, thereby reducing wait time.

Technologies used

Gradio

Pinecone

GPT-4o

Whisper-1

OpenAI-SDK

Discussion

Builders also viewed

See more projects on Devfolio