Nepali Image Captioning

This project is included in the AI-data science field. It performs the image captioning and presents the results in both visual and audible form in the native Nepali language.

243

Built at KU HackFest 2021

Created on 20th February 2021

•

Nepali Image Captioning

This project is included in the AI-data science field. It performs the image captioning and presents the results in both visual and audible form in the native Nepali language.

The problem Nepali Image Captioning solves

Problems: Everyone has an equal right to see the world, images but when it comes to the blind person, there are not many ways where they can also feel the image. Even for small children, learning from the images has been less practised. So to overcome this problem, we have come with a solution to image captioning.

Solution: It is used by the blind/disable and small kids to learn and understands what are present in the image, what are to be learned from it.
Not only this project can be helpful for various recommendation system and helps in recommending the new images based on the data collected from previous images.

Challenges we ran into

Firstly, the training of the model was very challenging for us, which in our local CPU would take more than 3 hours so we trained them in Kaggle notebooks hiring its GPU. Then, using the converted text into Nepali audio form, was quite hard which were solved using the version-controlled google translator. Similarly, on deploying the Heroku, static files weren't detected that were solved using separate Django Heroku libraries.

Technologies used

Django

Heroku

Google Translate API

Keras RNN

NLP

CSS3

HTML5

Discussion

Builders also viewed

See more projects on Devfolio