In-Domain Cross-Writer Sentiment Classification on Movie Reviews

Research Context

With the rise of the information age, sentiment analysis on texts has been a crucial Natural Language Processing task that many researchers have put effort on. Recent researches on sentiment analysis have obtained great results using various methods such as Naive Bayes, SVM, Recurrent Neural Networks and etc.

More recently, cross-domain sentiment classification that applies the model trained on texts in one specific domain, for instance, reviews to the texts in another domain, for instance, tweets has been a topic of interest because datasets without standard labels can be classified using a pretrained model, where normal sentiment classification models give significantly worse results. Popular methods for cross-doamin sentiment classification include Sentiment Sensitive Thesaurus, Stacked Denoising Auto-Encoders, Spectral Feature Alignment and etc.

Research Objective:

We are going to investigate how current models and our model perform on the In-Domain Cross-Writer sentiment classification task on movie review dataset by Pang and Lee \cite{Pang+Lee:05a}.

First we need to show that models that excel at normal sentiment classification tasks will end up with worse performance on this task. Then we start by presenting an empirical study of current methods of normal sentiment classification tasks and hopefully propose a new method that specifically addresses the this task.

If our method turns out that no significantly improved performance on experiments is observed, then we will need to account for the failure and gain insights on why some current methods outperform the others.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
present		present
scale_data		scale_data
scale_whole_review		scale_whole_review
writing		writing
.gitattributes		.gitattributes
.gitignore		.gitignore
LSTM.ipynb		LSTM.ipynb
Method_organize.ipynb		Method_organize.ipynb
Model1.py		Model1.py
NBSVM.ipynb		NBSVM.ipynb
README.md		README.md
Untitled.ipynb		Untitled.ipynb
comparison_res.csv		comparison_res.csv
cross_valid_manually12.py		cross_valid_manually12.py
function1.py		function1.py
glove_dl23.py		glove_dl23.py
glove_empirical.ipynb		glove_empirical.ipynb
glove_nn.ipynb		glove_nn.ipynb
glove_nn_bugfree.ipynb		glove_nn_bugfree.ipynb
implement_grocery.py		implement_grocery.py
in-domain.ipynb		in-domain.ipynb
model.h5		model.h5
model.json		model.json
model_ft_3l.h5		model_ft_3l.h5
model_ft_3l.json		model_ft_3l.json
model_gl.h5		model_gl.h5
model_gl.json		model_gl.json
model_gl_cnn.h5		model_gl_cnn.h5
model_gl_cnn.json		model_gl_cnn.json
movie_reviews_xuwen.ipynb		movie_reviews_xuwen.ipynb
pca2_xuwen.ipynb		pca2_xuwen.ipynb
plot.ipynb		plot.ipynb
pre_organize_xuwen.py		pre_organize_xuwen.py
preprocess_xuwen.ipynb		preprocess_xuwen.ipynb
pretrain_ke.py		pretrain_ke.py
test.csv		test.csv
toxic_xuwen.ipynb		toxic_xuwen.ipynb
train.csv		train.csv
word_embed.py		word_embed.py
word_embed_f.py		word_embed_f.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

In-Domain Cross-Writer Sentiment Classification on Movie Reviews

Research Context

Research Objective:

Data:

movie reviews on Rotten Tomato

Word Vector

Two principal components

Model

About

Releases

Packages

Contributors 3

Languages

CornellDataScience/SWiMR

Folders and files

Latest commit

History

Repository files navigation

In-Domain Cross-Writer Sentiment Classification on Movie Reviews

Research Context

Research Objective:

Data:

movie reviews on Rotten Tomato

Word Vector

Two principal components

Model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages