Skip to content

Binary Sentiment Analysis model for classification of movie reviews

Notifications You must be signed in to change notification settings

iftzp/binary-sentiment-analysis

Repository files navigation

Sentiment Analysis Model - Binary Classification Model of Movie Reviews

This repository contains all data and code relating to the second assignment for CA4023, Natural Language Technologies. The instructions for this assignment are available here.

In the polarity_predictor.ipynb notebook, we have created a sentiment analysis model to classify movie reviews as either postive or negative. The original baseline system utilised a Bag-of-Words representation. We have designed a Bag-of-Bigrams and Bag-of-Trigrams implementation for the purpose of experimentation. There are various parameters associated with the model, which alter the structure of these features also, inlcuding:

  • Clipping counts
  • Performing negation
  • Removing stop words
  • Lemmatising words
  • Using additional features

There are more specific details given about these parameters in the notebook itself. In the notebook, we also experiment with various learning models and output the results of these to CSV files which are stored in the output folder. The learning models which we experimented with include:

  • Multinomial Naive Bayes
  • Logistic Regression
  • Decision Tree Classifier
  • SVM Classifier
  • Random Forest Classifier

We experiment with the baseline model with and without the features above, and also conduct a number of additional experiments with different combinations of features.

The baseline system which uses Naive Bayes achieved an average 10-fold cross-validation accuracy of 82.9%.
The best-performing model used a Logistic Regression learning model with default parameters, and achieved an average 10-fold cross-validation accuracy of 86.6%.

The experiments and changes to code are discussed in the polarity_predictor.ipynb notebook

About

Binary Sentiment Analysis model for classification of movie reviews

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published