Sentiment Analysis Model - Binary Classification Model of Movie Reviews

This repository contains all data and code relating to the second assignment for CA4023, Natural Language Technologies. The instructions for this assignment are available here.

In the polarity_predictor.ipynb notebook, we have created a sentiment analysis model to classify movie reviews as either postive or negative. The original baseline system utilised a Bag-of-Words representation. We have designed a Bag-of-Bigrams and Bag-of-Trigrams implementation for the purpose of experimentation. There are various parameters associated with the model, which alter the structure of these features also, inlcuding:

Clipping counts
Performing negation
Removing stop words
Lemmatising words
Using additional features

There are more specific details given about these parameters in the notebook itself. In the notebook, we also experiment with various learning models and output the results of these to CSV files which are stored in the output folder. The learning models which we experimented with include:

Multinomial Naive Bayes
Logistic Regression
Decision Tree Classifier
SVM Classifier
Random Forest Classifier

We experiment with the baseline model with and without the features above, and also conduct a number of additional experiments with different combinations of features.

The baseline system which uses Naive Bayes achieved an average 10-fold cross-validation accuracy of 82.9%.
The best-performing model used a Logistic Regression learning model with default parameters, and achieved an average 10-fold cross-validation accuracy of 86.6%.

The experiments and changes to code are discussed in the polarity_predictor.ipynb notebook

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.ipynb_checkpoints		.ipynb_checkpoints
data		data
output		output
.DS_Store		.DS_Store
CA4023_Assignment2.pdf		CA4023_Assignment2.pdf
README.md		README.md
polarity_predictor.ipynb		polarity_predictor.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Analysis Model - Binary Classification Model of Movie Reviews

About

Releases

Packages

Contributors 2

Languages

iftzp/binary-sentiment-analysis

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis Model - Binary Classification Model of Movie Reviews

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages