Imagine the frustration of not being able to communicate effectively with strangers, leading to social isolation and misunderstandings. Consider the barriers to accessing information or services due to communication limitations, which can result in discrimination and even safety risks.
Think about the emotional distress and health-care issues that arise from these challenges, such as misdiagnosis and mental health struggles due to a lack of clear communication. These struggles are not just inconveniences but significant obstacles that affect the well-being and quality of life of mute communities.
Understanding these profound challenges faced by mute individuals serves as the driving force behind the development of this AI/ML web app. The inspiration lies in the desire to alleviate these struggles and empower mute individuals to communicate effectively, access information and services, and ultimately lead more connected and fulfilling lives.
Solution:
Our web application employs advanced technologies such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Graph Neural Networks (GNNs), and Large Language Models (LLMs) for real-time gesture recognition, translating sign language gestures into text or speech. Integrated with Socket.io for seamless communication, React for the frontend, and Flask for the backend, our platform facilitates instant interaction between mute individuals and others. Additionally, utilizing Google Translator and Google Text-to-Speech (gtts), we ensure accurate translation and natural-sounding speech synthesis, empowering mute individuals to overcome communication barriers, access information, and foster inclusivity in real-time interactions.
-> Difficulty Generating Frames Continuously: Our initial struggle revolved around the continuous generation of frames from the frontend and accessing the camera. This posed a critical obstacle as it directly impacted our ability to capture real-time sign language gestures accurately. Without a consistent stream of frames, our application's functionality was compromised, hindering the user experience and overall effectiveness of the platform.
-> Integration Issues without Socket.io: Trying to implement our system without leveraging Socket.io led to numerous integration issues. Notably, we encountered slow frame transmission and significant delays in responses. These delays not only affected the responsiveness of our application but also led to automatic cessation of frame reception after a certain period. This automatic stoppage further exacerbated the problem, creating a disjointed and frustrating user experience.
-> Integration Challenge of Socket.io in Flask Backend: Integrating Socket.io into our Flask backend presented a significant technical challenge. Establishing seamless communication between the frontend and backend is crucial for real-time interaction, which is fundamental to our application's functionality. However, overcoming the intricacies of integrating Socket.io within the Flask framework required careful attention to detail and thorough troubleshooting to ensure stable and reliable communication channels.
-> Frontend and Backend Integration: The integration of frontend and backend components proved to be another significant challenge. Smooth communication and data exchange between these two parts of the application are essential for delivering a seamless user experience. Achieving this required meticulous coordination and troubleshooting to address compatibility issues and ensure that data was transmitted accurately and efficiently between the frontend and backend systems.
Tracks Applied (4)
Discussion