Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add comp #57

Merged
merged 13 commits into from
Mar 5, 2024
Merged

Add comp #57

merged 13 commits into from
Mar 5, 2024

Conversation

enryH
Copy link
Member

@enryH enryH commented Feb 20, 2024

Streamline comparison and add new ways of sampling simulated missing values.

fixed that DAE and VAE can also be trained without a validation split
cleaned up model classes, started to get ride of PeptideAnalyzer (also moving code)
no dependency on Thermo specific metadata (basic quality control by number of features in sample)

Henry added 7 commits February 19, 2024 18:04
- start to get rid of AnalyzePeptides class
- base filtering on number of quantified samples (sample completness)
- 🔥 remove old funtions which are obsolete
- 🎨 rename fake_na to simulated_na (to do everywhere)
- clean-up fastai imports and assign parts explicitly for linting
- 🔥 remove one subclassing level, commented code
- 🎨 rename fake to simulated NA and clean-up notebook
- if there will be a command line interface, then it will be defined in __main__.py
- 🎨 remove one level of inheritence
- 🎨 rename fake to simulated NA
Idea: Only use a subset of samples to generate simulated validation
and test data from. Performance is then split by validation and test samples, although the remaining data can be used at training time.
(will be compared to setup where simulated missing values are sampled from all samples)
@enryH enryH force-pushed the add_comp branch 2 times, most recently from ebb42d1 to 651f6bc Compare February 23, 2024 13:11
- fastai routine needs always a validation dataset, but it can be empty.
@enryH enryH marked this pull request as ready for review March 5, 2024 18:19
@enryH
Copy link
Member Author

enryH commented Mar 5, 2024

  • fixed that DAE and VAE can also be trained without a validation split
  • cleaned up model classes, started to get ride of PeptideAnalyzer (also moving code)
  • no dependency on Thermo specific metadata (basic quality control by number of features in sample)

@enryH enryH merged commit 86aa007 into dev Mar 5, 2024
7 checks passed
@enryH enryH deleted the add_comp branch March 5, 2024 18:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant