Rmmiller rescue and resolve #113

rmmiller22 · 2020-12-14T18:50:58Z

Rescue and Resolve protein inference algorithm is implemented within MetaMorpheus. Transcript CPM values are used to rescue proteins that would normally be eliminated during the parsimonious process. Unit tests are included.

PR checklist

This comment contains a description of changes (with reason)
CHANGELOG.md is updated
If you've fixed a bug or added code that should be tested, add tests!
Documentation in docs is updated

* Adds author name in README.md * Adds author name in README.md * Deletes temp file * Adds author name in README.md

basic proteogenomic object info in metamorpheus

* add author name to readme.md * add one line to refresh commit * add author name Co-authored-by: Michael Shortreed <[email protected]> Co-authored-by: cgpu <[email protected]>

* Added authorname in README Co-authored-by: cgpu <[email protected]>

* Adds Rachel Miller to the author names in the README * Minor typo Co-authored-by: cgpu <[email protected]>

Co-authored-by: cgpu <[email protected]>

* Add initial code to extract and cluster pacbio protein sequences, based on input from LR_ORFCalling * aggregation of FL and CPM by cluster Co-authored-by: Robert Millikin <[email protected]> Co-authored-by: Gloria Sheynkman <[email protected]>

* Add author name in README.md * orf calling updted to run from command line Co-authored-by: gsheynkman <[email protected]>

* Adds nf-core template for nextflow pips * Cleans up template main.nf and adds swag cli message * Updates nextflow.config * Adds Dockerfile and env yaml updates * Removes redundant files from assets * Deleted nf schema json * Removes redundant configs * Updates README with template structure * Updates docs/ * Updates repo name in changelog * Updates template test.config * Adds bin folder and template wrapper R script * Adds pbccs in env.yml * Changes the location of pipeline info, logs * Adds .github folder * Removes redundant files from GH actions * Removes AWS tests * Adds misspelling test * Removes linting.yml * Removes igenomes config * Adds tentative LICENSE (MIT) * Adds nudge for asking help via GH issues

weighted protein inference in MetaMorpheus

This adds the genomic data compilation and comparison jupyter notebook script and adds several custom module dependencies.

* update README contributions * new readme

* update README contributions * new readme * fix readme errors

MetaMorpheus: excel compatible tsv by default

…ables

* Adds author name in README.md * Adds author name in README.md * Deletes temp file * Adds author name in README.md * Modified README.md File in LR_TranscriptomeSummary * Add files via upload This adds the genomic data compilation and comparison jupyter notebook script and adds several custom module dependencies. * Update README.md * Updated version of previous files with less typos * Delete Transcriptomic_Proteomic_Comparison.ipynb * Delete m_MMprocess.py * Delete m_gen_maps.py * Delete m_make_gene_length_table.py * Delete m_sqantitable.py * Delete m_squantitable.py * Updated version with less typos * Update README.md * Preliminary module for analyzing peptide space * Add files via upload Update of peptide analysis jupyter notebook script * Convert jupyter notebook into python * Updated peptide_analysis script for review and added required files/tables * Update peptide_analysis.py * Updated .gitignore with a local data file * Updated peptide_analysis.py to include new path info * Delete gene_based_info.tsv * Delete trans_to_gene.tsv

* Add initial code to extract and cluster pacbio protein sequences, based on input from LR_ORFCalling * Started code for protein group mapping * add toy tables for the protein inference mapping * edited 6frm translate readme * delete mock files for protein inference (protein group) comparisons. Rachel and Kyndalanne have continued to work on this and these may be outdated. Co-authored-by: Robert Millikin <[email protected]> Co-authored-by: Gloria Sheynkman <[email protected]>

* Separate module for greedy protein inference * protein_inference bug fix * added rescue to greedy algorithm * connected peptides changed to set * small bug fix. cleaned up notebook

* Adds author name in README.md * Adds author name in README.md * Deletes temp file * Adds author name in README.md * Modified README.md File in LR_TranscriptomeSummary * Add files via upload This adds the genomic data compilation and comparison jupyter notebook script and adds several custom module dependencies. * Update README.md * Updated version of previous files with less typos * Delete Transcriptomic_Proteomic_Comparison.ipynb * Delete m_MMprocess.py * Delete m_gen_maps.py * Delete m_make_gene_length_table.py * Delete m_sqantitable.py * Delete m_squantitable.py * Updated version with less typos * Update README.md * Preliminary module for analyzing peptide space * Add files via upload Update of peptide analysis jupyter notebook script * Convert jupyter notebook into python * Updated peptide_analysis script for review and added required files/tables * Update peptide_analysis.py * Updated .gitignore with a local data file * Updated peptide_analysis.py to include new path info * Delete gene_based_info.tsv * Delete trans_to_gene.tsv * Removed unnecessary files from Transcriptome Module * Removed unnecessary files from Transcriptome module * Removed unnecessary files from Transcriptome module * Removed unnecessary files from Transcriptome module

…odules (#78) * Files in progress to create three modules: ReferenceTables, TranscriptomeAnalysis, PeptideAnalysis. Also, debugged orf_calling.py, found that minus strand ORFs not included. * Prepared a script that makes reference tables * Updated Transcriptomic Script * Updated Transcriptomic Script (#77) Co-authored-by: kyuubi430 <[email protected]> * Remove files for making three modules with simi. * Cleaned up referencetable module, Simi to edit. * Modified Reference Tables Script * Deleted plots. * Simi and Gloria finalized the prepare_reference_tables. Works on commandline. Correct outputs to results/PG_ReferenceTables. * Small edits to peptide_analysis, not done, push to Simi. * Modified the names out output files from Prepare Reference Tabe script * Changed file names in reference tables script and modified the transcriptome summary * Delete unneeded files in transcriptome summary module. * Finalized ReferenceTables. tested Transcriptome Summary. Started modifying the PeptideAnalysis. * Made the transcriptome summary script command line executable * Made the peptide analysis script command line runnable * In process of modifying MMprocessing script * Move scripts between TranscriptomeSummary and PeptideAnalysis modules. Code related to MM peptide/protein processing will now be exclusively in PeptideAnalysis. * Added fasta/tsv and the results directory to gitignore * Delete jurkat_orf_refined.fasta Don't want to include *fasta in pull request. * Delete genes_in_refined.tsv Don't want to include *tsv output file in PR. Added *tsv to gitignore, so shouldn't upload in future PR. Co-authored-by: kyuubi430 <[email protected]>

* Adds nf-core template for nextflow pips * Cleans up template main.nf and adds swag cli message * Updates nextflow.config * Adds Dockerfile and env yaml updates * Removes redundant files from assets * Deleted nf schema json * Removes redundant configs * Updates README with template structure * Updates docs/ * Updates repo name in changelog * Updates template test.config * Adds bin folder and template wrapper R script * Adds pbccs in env.yml * Changes the location of pipeline info, logs * Adds .github folder * Removes redendant files from GH actions * Updates CONTRIBUTING.md * Updates ISSUE_TEMPLATE * Update PULL_REQUEST_TEMPLATE.md * Removes AWS tests * Adds misspelling test * Removes linting.yml * Corrects typo * Removes igenomes config * Fixes typos caught by review-dog * Adds tentative LICENSE * Adds environment.yml with pandas, numpy, biopython * Adds CCS process * Adds pbbam (required for ccs --chunk subsequent routine) * Adds pbindex, ccs processes (w/ parallel --chunks) * Removes redundant bai (pbi is needed) * Adds temp process mock ccs and flag for testing * Deletes commented out section To respect the rule, "we do not choose to modify cod ebehaviour by commenting in and out code chunks", * Makes the section note more informative

* Adds Rachel Miller to the author names in the README * custom script for the comparison of protein group output from MetaMorpheus searches using different protein database reference models * Make protein inference analysis script command line executable * spelling fixes * Update PI_proteinInferenceAnalysis.py fix merge conflicts

modules/PG_MetaMorpheus/Test/RescueAndResolveTests.cs

kyuubi430 and others added 30 commits November 7, 2020 17:41

Adds author name in README.md

47d6782

Adds author name in README.md

5b50495

Deletes temp file

83fda18

Adds author name in README.md

20f3150

Adds Author Name in README (#15)

267424e

* Adds author name in README.md * Adds author name in README.md * Deletes temp file * Adds author name in README.md

add long read info basics

4616316

gui app manifest gitignore fix

1844fbc

Merge pull request #24 from sheynkman-lab/rmillikinBranch

2083209

basic proteogenomic object info in metamorpheus

Adds @trishorts name in README.md (#18)

a32ae36

* add author name to readme.md * add one line to refresh commit * add author name Co-authored-by: Michael Shortreed <[email protected]> Co-authored-by: cgpu <[email protected]>

Adds @gsheynkman name to README.md (#16)

8034664

* Added authorname in README Co-authored-by: cgpu <[email protected]>

Adds Rachel Miller to the author names in the README (#14)

299be93

* Adds Rachel Miller to the author names in the README * Minor typo Co-authored-by: cgpu <[email protected]>

added author and ORCID (#12)

66e67bc

Co-authored-by: cgpu <[email protected]>

Refined database orig code (#29)

b73cb45

* Add initial code to extract and cluster pacbio protein sequences, based on input from LR_ORFCalling * aggregation of FL and CPM by cluster Co-authored-by: Robert Millikin <[email protected]> Co-authored-by: Gloria Sheynkman <[email protected]>

Bj8th orf calling (#25)

e8d1f4c

* Add author name in README.md * orf calling updted to run from command line Co-authored-by: gsheynkman <[email protected]>

added pull scripts from the zenodo site

cd06b5e

weighted protein inference

9e58d65

fixes

5721d06

remove script to hopefully avoid merge conflict..

d8b8d11

update mzLib

287519c

require equal long read weight for indistinguishable proteins

e9110cd

add contrib

a308003

Modified README.md File in LR_TranscriptomeSummary

8cb21cd

Merge pull request #37 from sheynkman-lab/dev_rmillikin

6f50035

weighted protein inference in MetaMorpheus

Add files via upload

3b5598c

This adds the genomic data compilation and comparison jupyter notebook script and adds several custom module dependencies.

update main readme with author names (#38)

bb67cb4

* update README contributions * new readme

fix minor errors in readme (#40)

e20d6e9

* update README contributions * new readme * fix readme errors

excel compatible tsv by default

bf20f1d

accept thermo license by default

7ade9e8

Merge pull request #42 from sheynkman-lab/dev_rmillikin

cd500c3

MetaMorpheus: excel compatible tsv by default

kyuubi430 and others added 21 commits November 18, 2020 14:25

Updated peptide_analysis script for review and added required files/t…

670380d

…ables

Update peptide_analysis.py

d427299

Updated .gitignore with a local data file

3907188

Updated peptide_analysis.py to include new path info

ef37cab

Delete gene_based_info.tsv

d3fd727

Delete trans_to_gene.tsv

3910a8b

Protein Inference (#74)

fb85c11

* Separate module for greedy protein inference * protein_inference bug fix * added rescue to greedy algorithm * connected peptides changed to set * small bug fix. cleaned up notebook

Removed unnecessary files from Transcriptome Module

5d349e2

Removed unnecessary files from Transcriptome module

a3692bd

Removed unnecessary files from Transcriptome module

fae68bc

Removed unnecessary files from Transcriptome module

f120085

Merge branch 'kyuubi430-this-is-what-I-am-doing-dev' into dev

262ebc5

rescue algorithm implemented

a713e75

merge branches (rescue and main)

924723e

gerge branch 'main_merge' into rmmiller_rescueAndResolve

f892f5f

rmmiller22 requested review from rmillikin, bj8th and gsheynkman December 14, 2020 18:51

github-actions bot reviewed Dec 14, 2020

View reviewed changes

rmmiller22 added 3 commits December 14, 2020 13:20

eliminate added conflict files

f8a7969

config file remove

09814d0

spelling fix

65cda5f

bj8th approved these changes Dec 14, 2020

View reviewed changes

bj8th merged commit 67f0ced into main Dec 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rmmiller rescue and resolve #113

Rmmiller rescue and resolve #113

rmmiller22 commented Dec 14, 2020 •

edited

Loading

Rmmiller rescue and resolve #113

Rmmiller rescue and resolve #113

Conversation

rmmiller22 commented Dec 14, 2020 • edited Loading

PR checklist

rmmiller22 commented Dec 14, 2020 •

edited

Loading