A Journey through Machine Learning

Text classification is achieved through the next Machine Learning techniques:

Additionally the datasets were pre-processed under the NLP guidelines, which covers:

The pre-processed files were vectorized in 3 different ways:

All previous models were applied in the following datasets:

Datasets can be found at:

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.DS_Store		.DS_Store
AML_CW2.py		AML_CW2.py
AWL_CW2.ipynb		AWL_CW2.ipynb
DeepNeuralModel.py		DeepNeuralModel.py
DeepNeuralModel_w2v.py		DeepNeuralModel_w2v.py
Movie Text Processing.ipynb		Movie Text Processing.ipynb
README.md		README.md
Toxic Text Processing (with multi-label classification).ipynb		Toxic Text Processing (with multi-label classification).ipynb
Tweet Text Processing.ipynb		Tweet Text Processing.ipynb
amzn_preprocessed.pickle		amzn_preprocessed.pickle
classifiers.py		classifiers.py
classifiersEdit.py		classifiersEdit.py
data.mat		data.mat
movie_dic.txt		movie_dic.txt
movie_preprocessed.pickle		movie_preprocessed.pickle

Provide feedback