Created on 24th September 2022
•
The Yahoo! Answers topic classification dataset is constructed using the 10 largest main categories. Each class contains 140,000 training samples and 6,000 testing samples. Therefore, the total number of training samples is 1,400,000, and the testing samples are 60,000 in this dataset. From all the answers and other meta-information, we only used the best answer content and the primary category information.
The deep-learning model, I trained classifies the tweets into 10 different labels.
It's my first time training deep learning models and I never imagined I would be able to do it. Learning and doing was my only aim during building this project.