As online lectures have increased so did recorded video content.We want to use speech translation and summarisation to create trnascripts,srts, and note of lecture to make it easier for students to go through these video.
Speech recognition model requires heavy computation and most api provide only 30s free conversion so if we used any powerful model for translation then text summarisation could also be implemented
Discussion