optimization-project

Project of the OPTIMIZATION course. The objective is to find a good initialization for the random decision trees, in order to speed up convergence.

The algorithm used to divide handle the clustering of the dataset and the initialization are all in the "scr/cluster.py" file.

src/ORCTModel.py it's just used in the notebooks, but it was not used to train the agent. Instead, the "sorct.py" file contains the correct model.

"source/" contains the utils used in sorct.py. This structure is inherited from the original code base.

run_tests.py loads all datasets, runs 4 different clustering algorithms, fit the HLR and then runs the optimization step. The results are put in the results/ folder (create it if not present). train_test.py loads a configuration of parameters and tests them on the folds of a train set. This folds are generated through KMeans. While the code works properly, the predictions are all random and it's not clear why.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

optimization-project

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
datasets		datasets
notebooks		notebooks
source		source
src		src
.gitignore		.gitignore
README.md		README.md
run_tests.py		run_tests.py
sorct.py		sorct.py
train_test.py		train_test.py

OscarPindaro/optimization-project

Folders and files

Latest commit

History

Repository files navigation

optimization-project

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages