For Students:
For Companies:
Integrating a realistic and interactive 3D model interviewer using three.js while ensuring smooth performance and synchronization with audio files posed a significant challenge. We addressed this by using optimized open-source 3D models to maintain a seamless experience.
Generating accurate lipsync data and audio files dynamically using AWS Polly was challenging. To overcome this, we implemented a caching mechanism to store and reuse previously generated files, reducing latency and resource usage.
Ensuring that each user session is unique and secure, preventing multiple concurrent sessions, was difficult. We established a robust WebSocket connection with server-side checks to enforce single-session rules.
Implementing an automatic recording system that accurately detects pauses and stops recording appropriately was challenging. We utilized precise audio analysis algorithms to detect pauses exceeding 4 seconds and trigger automatic stop.
Achieving high accuracy in transcribing user responses using the Deepgram API required extensive testing and fine-tuning of the API settings to improve transcription accuracy.
Developing a comprehensive feedback system using OpenAI's API that evaluates responses on multiple criteria was challenging. We performed thorough prompt engineering to refine the evaluation prompts and ensure reliable and relevant feedback.
Ensuring the system can handle multiple users simultaneously without performance degradation was a significant challenge. We deployed the backend on AWS EC2 for scalability and used efficient database queries to handle high loads.
Technologies used
Discussion