BlackBox

Our code was tested on Ubuntu 18.04 with CUDA 10.2, quad core system with NVIDIA GeForce RTX 2070 8 GB RAM GPU, and 48 GB system RAM. It needs at least 20 GB of free disk space.

Run all code within the same directory containing BlackBox-production.yml, DATA_TRAINING.RData and TRUE_DATA_RANKING.RData.

Install conda: https://docs.conda.io/projects/conda/en/latest/user-guide/install/#regular-installation

Create conda environment with Python and R packages.

$ conda env create -f BlackBox-production.yml

$ conda activate BlackBox-production

Convert data and generate random training masks.

$ python convert_data.py

$ python generate_extra_masks.py  # takes up to 12 hours to complete

$ python shift_bitmaps.py

$ python downsample_bitmaps.py

Train models and evaluate their ensemble prediction (approximately 7 days to complete):

$ bash train_models.sh  # BASH script training an ensemble of 155 models, each with specific hyperparameters

$ python make_models_database.py -i Model

$ Rscript preprocesing_TRUE_DATA_RANKING.R  # creates file true.observations.rda with data used in score calculation

$ python make_prediction.py -i Model -o Model_score --number_of_predictions=20 --prediction_suffix=SampleA

$ python make_prediction.py -i Model -o Model_ensemble_score --only_score=20:SampleA --global_ensemble

Final prediction is saved in ./predictions/. Calculated score is saved in the last line of ./Model_ensemble_score_20_SampleA.csv.

Comment on reproducibility

In our code we use the following Python libraries which rely on different random number generators: random from stdlib, numpy, pytorch, and fastai. Since at least pytorch does not guarantee reproducibility between different platforms, CPU/GPU runs, and library versions even with a fixed random seed, we have opted not to seed any of the random number generators in our solution. This means a certain amount of variation in final result is expected between runs.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
BlackBox-production.yml		BlackBox-production.yml
DATA_FILES.txt		DATA_FILES.txt
LICENSE		LICENSE
README.html		README.html
README.ipynb		README.ipynb
README.md		README.md
convert_data.py		convert_data.py
database_categories_spec.py		database_categories_spec.py
downsample_bitmaps.py		downsample_bitmaps.py
evaluate_score.py		evaluate_score.py
flat_cosine_mod.py		flat_cosine_mod.py
generate_extra_masks.py		generate_extra_masks.py
load_data_batch_3.py		load_data_batch_3.py
make_models_database.py		make_models_database.py
make_prediction.py		make_prediction.py
model.py		model.py
shift_bitmaps.py		shift_bitmaps.py
train_model.py		train_model.py
train_models.sh		train_models.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BlackBox

Comment on reproducibility

About

Releases

Packages

Contributors 2

Languages

License

BlackBox-EVA2019/BlackBox

Folders and files

Latest commit

History

Repository files navigation

BlackBox

Comment on reproducibility

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages