Add comp #57

enryH · 2024-02-20T12:11:43Z

Streamline comparison and add new ways of sampling simulated missing values.

fixed that DAE and VAE can also be trained without a validation split
cleaned up model classes, started to get ride of PeptideAnalyzer (also moving code)
no dependency on Thermo specific metadata (basic quality control by number of features in sample)

- start to get rid of AnalyzePeptides class

- base filtering on number of quantified samples (sample completness)

- 🔥 remove old funtions which are obsolete - 🎨 rename fake_na to simulated_na (to do everywhere) - clean-up fastai imports and assign parts explicitly for linting

- 🔥 remove one subclassing level, commented code - 🎨 rename fake to simulated NA and clean-up notebook

- if there will be a command line interface, then it will be defined in __main__.py

- 🎨 remove one level of inheritence - 🎨 rename fake to simulated NA

Idea: Only use a subset of samples to generate simulated validation and test data from. Performance is then split by validation and test samples, although the remaining data can be used at training time. (will be compared to setup where simulated missing values are sampled from all samples)

- fastai routine needs always a validation dataset, but it can be empty.

- set fastai defaults to cpu

- compare two sampling strategies

enryH · 2024-03-05T18:26:16Z

fixed that DAE and VAE can also be trained without a validation split
cleaned up model classes, started to get ride of PeptideAnalyzer (also moving code)
no dependency on Thermo specific metadata (basic quality control by number of features in sample)

Henry added 7 commits February 19, 2024 18:04

🎨➕ use njab, depreceate AnalyzePeptides for data loading

c6928ef

- start to get rid of AnalyzePeptides class

🎨 remove RT_min parameter (replace by sample_completeness)

89dd625

- base filtering on number of quantified samples (sample completness)

🎨🔥 clean up collaberative filtering code

8948ef0

- 🔥 remove old funtions which are obsolete - 🎨 rename fake_na to simulated_na (to do everywhere) - clean-up fastai imports and assign parts explicitly for linting

🎨 DAE: simplify ModelAdapter

1bdc68a

- 🔥 remove one subclassing level, commented code - 🎨 rename fake to simulated NA and clean-up notebook

🔥 remove cmd code

37e4e1c

- if there will be a command line interface, then it will be defined in __main__.py

🎨 Combine VAE ModelAdapter CallBack

4dac5a7

- 🎨 remove one level of inheritence - 🎨 rename fake to simulated NA

enryH force-pushed the add_comp branch 2 times, most recently from ebb42d1 to 651f6bc Compare February 23, 2024 13:11

✨ make DAE and VAE train without a validation dataset

a4f36da

- fastai routine needs always a validation dataset, but it can be empty.

enryH force-pushed the add_comp branch from 651f6bc to a4f36da Compare February 23, 2024 13:27

Henry added 5 commits February 24, 2024 09:45

🐛 make CF Sklearn model run without MPS

bae51d7

- set fastai defaults to cpu

🎨 small improvements in tutorial and explicit import for CF Transformer

e5d48e8

✨ rev. 3 tables

2c83fd3

- compare two sampling strategies

✨ Add option to dump agg. pred across models

2476a58

📝 improve installation instructions

92b7fa7

enryH marked this pull request as ready for review March 5, 2024 18:19

enryH merged commit 86aa007 into dev Mar 5, 2024
7 checks passed

enryH deleted the add_comp branch March 5, 2024 18:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add comp #57

Add comp #57

enryH commented Feb 20, 2024 •

edited

Loading

enryH commented Mar 5, 2024

Add comp #57

Add comp #57

Conversation

enryH commented Feb 20, 2024 • edited Loading

enryH commented Mar 5, 2024

enryH commented Feb 20, 2024 •

edited

Loading