To change the LSTM into using numbers and not the discretization #184

AlyaGomaa · 2023-02-10T13:16:47Z

Created by Alya Gomaa via monday.com integration. 🎉

tahifahimi · 2024-02-28T20:34:30Z

In the following, we present the performance evaluation of four different machine learning models trained on a dataset (”modules/rnn_cc_detection/datasets/dataset_more_labels.dat”). The models evaluated include Random Forest, Support Vector Machine (SVM), k-Nearest Neighbors (KNN), and Recurrent Neural Network (RNN). The dataset comprises features labelled with binary classes.
#316 is proposed using one-hot encoding, but in the following, we used the StratoLetter mapping to integers.

Model Overview:

Random Forest:
- Accuracy: 1
- Methodology: The Random Forest classifier achieved an accuracy of 100% and an F1 score of 1. It was trained using 100 decision trees.
Support Vector Machine (SVM):
- Accuracy: 0.8461
- F1 Score: 0.9166
- Methodology: The SVM model, utilizing a radial basis function kernel, attained an accuracy of 84% and an F1 score of 0.91. The features were scaled using StandardScaler.
k-Nearest Neighbors (KNN):
- Accuracy: 0.7692
- F1 Score: 0.8695
- Methodology: The KNN classifier with 5 neighbors achieved an accuracy of 76% and an F1 score of 0.86. The features were scaled using StandardScaler.
Recurrent Neural Network (RNN):
- Accuracy: 0.8461
- Loss: 0.6770
- Methodology: The RNN model, a Bidirectional GRU with dropout layers, achieved an accuracy of 84% on the test dataset. It was trained for 10 epochs using RMSprop optimizer.

Discussion:

The Random Forest model demonstrated the highest accuracy among the traditional machine learning models evaluated, achieving 100% accuracy.
The dataset has 62 records. It is expected that by increasing the number of records, the model's accuracy will increase.
All models were trained and tested on the same dataset split, ensuring fair comparison of their performance metrics.

Details are available at 7fbc2ce

AlyaGomaa added the Enhancement label Feb 14, 2023

eldraco added the Machine Learning Needs knowledge of Machine Learning label Feb 24, 2023

prakharguptaujjain mentioned this issue Apr 20, 2023

Added GRU_using_numbers #316

Open

tahifahimi linked a pull request Feb 6, 2024 that will close this issue

Add Hot Encoding to the RNN model #447

Open

7 tasks

AlyaGomaa added this to Slips Jul 12, 2024

github-project-automation bot moved this to Todo in Slips Jul 12, 2024

eldraco added this to the Fix the ML models milestone Jul 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

To change the LSTM into using numbers and not the discretization #184

To change the LSTM into using numbers and not the discretization #184

AlyaGomaa commented Feb 10, 2023

tahifahimi commented Feb 28, 2024 •

edited

Loading

To change the LSTM into using numbers and not the discretization #184

To change the LSTM into using numbers and not the discretization #184

Comments

AlyaGomaa commented Feb 10, 2023

tahifahimi commented Feb 28, 2024 • edited Loading

tahifahimi commented Feb 28, 2024 •

edited

Loading