Skip to content

run a multitude of classifiers on you data and get an AUC report

License

Notifications You must be signed in to change notification settings

MRN-Code/polyssifier

 
 

Repository files navigation

Coverage Status Build Status

Polyssifier

Polyssifier runs a multitude of machine learning classifiers on your data. It reports scores, confusion matrices, predictions, and a plot of the scores ranked by classifier performance.

Requirements

  • Python 3.3 or higher.
  • Keras
  • Sklearn
  • Numpy
  • Pandas

Optional

  • Nvidia GPU
  • CUDA

Features

  • Cross validated results.
    • Report with f1 score (scoring='f1') or ROC (scoring='auc')
  • Parallel processing.
    • Control the number of threads with 'concurrency'.
    • We recommend setting concurrency to half the number of Cores in your system.
  • Support for Nvidia GPUs (MLP only).
    • Set theano flag "device=gpu".
  • Saves trained models for future use in case of server malfunction.
    • Set project_name for identifying a experiment.
  • Activate feature selection step setting
    • feature_selection=True
  • Automatically scales your data with scale=True

Not compatible with Python 2 for the moment. We need a replacement for the "starmap" method in multiprocessing

Example: on sample/example.ipynb Example:

https://github.com/MRN-Code/polyssifier/blob/master/sample/example.ipynb

It includes the following classifiers:

  • Multilayer Perceptron (see mlp.py )
  • Nearest Neighbors
  • Linear SVM
  • RBF SVM
  • Decision Tree
  • Random Forest
  • Logistic Regression
  • Naive Bayes
  • Voting Classifier

You can exclude some of this classifiers by providing a list of names as follows:

from polyssifier import poly, plot

scores, confusions, predictions = poly(data,label, n_folds=8, exclude=['Multilayer Perceptron'], verbose=1, concurrency=4)
plot(scores)

About

run a multitude of classifiers on you data and get an AUC report

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 93.8%
  • Python 6.2%