5 techniques
- BGR : Blue Green Red
- RGB : Red Green Blue
- Graye Level
- Adaptive Threshold Gray Level
- Gray Level Co occurrence Matrix (GLCM) features 'dissimilarity', 'contrast', 'homogeneity', 'energy', 'ASM', 'correlation'
Augmented by Gaussian Filter
30% data of each class is augmented and then saved.
30% increase in data (600 to 750+)
'dissimilarity', 'contrast', 'homogeneity', 'energy', 'ASM', 'correlation'
SVM, KNN, Logistic Regression, Random Forest, Naive Bayes
Learning curve
Validation Curve
Both use Stratified KFold cross validation by default
Accuracy, Precision, Recall, F1 Score, and AUC ROC score
KNN performs better most of the time. It gave a better accuracy of almost 64% with auc roc score of
0.90 as compare to other five models. Analyzed overfitting, underfitting, variance, and bias too.