"course_copilot"
A Multi Translating and Converting(Speech to Text and vice versa) Featured Web-App and Android Smartphone App called as course copilot.
Created on 1st October 2024
•
"course_copilot"
A Multi Translating and Converting(Speech to Text and vice versa) Featured Web-App and Android Smartphone App called as course copilot.
Describe your project
1.a. My application can translate the text given in English to any Indian language like Hindi, Telugu, Marathi etc.
b. My application can convert the text given in English to Speech or Audio .
c. My application can convert the speech or Audio in any language to the Text and then translate the text into any target
language, like Indian language like Hindi, Telugu, Marathi, Tamil, Kannada,Odiya, Punjabi, etc
d. My application has the provision or the feature to add images and animations from the device .
e. My application can convert the speech or audio in any language to the text in English language.
f. It can be deployed as a Chrome extension, webapp, or smartphone app.
- Actually as per the problem statement shared by Sarvam AI using Sarvam API's speech to text api or url which is
"https://www.sarvam.ai/apis/speech-to-text" this Application should directly covert the audio of speech in any language to the selected target langauage but My application converts any speech or audio into only text in english language.
3.There are future opportunities and possibilities for this solution we can use these Sarvam AI's API or Urls in many applications based on speech translation, text Translation and both, also wen use this application's idea or code in In App AI Integration, Whatsapp Integration and Voice Call integration. We can also use these in building personl language chatbots customized with multi lingual features and chatGPT kind of applications in our desired language.
Challenges I ran into
Some of the major problems or difficulties faced are converting the speech or audio recorded into different formats like audio.mp3, audio.wav and audio.mpeg etc. Another challenge is converting the recorded audio as file and then uploading the file. And converting the speech to text and translating again into multiple indian languages using multipart audio translation.
Tracks Applied (1)
14. Problem statement shared by Sarvam AI
Cheer Project
Cheering for a project means supporting a project you like with as little as 0.0025 ETH. Right now, you can Cheer using ETH on Arbitrum, Optimism and Base.
