This notebook is the implementation of the Ranking Algorithm for the Priority Inbox from O'Reilly Media's Machine learning for Email by Drew Conway and John Miles in Python3. The dataset used in this implementation is from the Apache SpamAssassin Public Corpus and is included in this repository.
The modules used in this notebook are:
- numpy
- pandas
- nltk
- sklearn
- matplotlib
To install any of the module above, do
pip install <module-name>
.