S

Sentiment-analysis-on-Disney-land-reviews

Performed EDA and sentiment analysis of reviews using metrics like Sentiment Polarity and VADER Polarity in NLP and created wordclouds for better visulaization.

73
S

Sentiment-analysis-on-Disney-land-reviews

Performed EDA and sentiment analysis of reviews using metrics like Sentiment Polarity and VADER Polarity in NLP and created wordclouds for better visulaization.


The problem Sentiment-analysis-on-Disney-land-reviews solves

The dataset includes 42656 reviews of 3 Disneyland branches - Paris, California and Hong Kong, posted by visitors on Trip Advisor.
Column Description:

Review_ID: unique id given to each review
Rating: ranging from 1 (unsatisfied) to 5 (satisfied)
Year_Month: when the reviewer visited the theme park
Reviewer_Location: country of origin of visitor
Review_Text: comments made by visitor
Disneyland_Branch: location of Disneyland Park

On this dataset I have performed EDA and sentiment analysis of reviews using metrics like Sentiment Polarity and VADER Polarity in NLP and created wordclouds for better visulaization.This processed data is then feeded to different classifier models to get trained and predict the sentiment of the test reviews.

Models used:
XGBoost - Extreme Gradient Boost alsorithm is based on the Gradient Boosting model which uses the boosting technique of ensemble learning where the underfitted data of the weak learners are passed on to the strong learners to increase the strength and accuracy of the model.
Decision Tree - This algorithm works on the basis of creating tree structures to take decisions
Random Forest - This algorithm works on the concept of emsemble learning.It used bagging technique to train multiple predictors on the same sampled instances to achieve a higher degree of accuracy.

XGBoost gave highest accuracy among all . Test accuracy : 83%

Challenges I ran into

Performing advance level EDA and drawing insights from it was little bit difficult. I refereed to various articles and kaggle notebooks and was able to understand how to draw conclusions.

Technologies used

Discussion