That’s Rotten
Predictive Models
Python
NLP
Can we use NLP to determine if the sentiment of a movie review is good or bad? Project uses data from Rotten Tomatoes to classify reviews using different classification models.
This project was part of a Data Science hackathon, participants had 7 hours to conduct EDA and build predictive models with a dataset of their choosing.
In this project, I focused on a dataset about Rotten Tomatoes movie reviews. The dataset was comprised of 10K+ text reviews and included classification labels (“Rotten”, “Fresh”).
My objective was to build a NLP classification model that could predict whether a review was “Rotten” and beat the baseline accuracy (50%).
![slides/slide-0.jpeg](slides/slide-0.jpeg)
![slides/slide-1.jpeg](slides/slide-1.jpeg)
![slides/slide-2.jpeg](slides/slide-2.jpeg)
![slides/slide-3.jpeg](slides/slide-3.jpeg)
![slides/slide-4.jpeg](slides/slide-4.jpeg)
![slides/slide-5.jpeg](slides/slide-5.jpeg)
![slides/slide-6.jpeg](slides/slide-6.jpeg)
![slides/slide-7.jpeg](slides/slide-7.jpeg)
![slides/slide-8.jpeg](slides/slide-8.jpeg)
![slides/slide-9.jpeg](slides/slide-9.jpeg)
![slides/slide-10.jpeg](slides/slide-10.jpeg)
![slides/slide-11.jpeg](slides/slide-11.jpeg)