-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merge external splicing counts #247
Changes from 67 commits
4b6b0bf
e0e5844
7e59764
434135d
d353a56
2712658
f85e130
91566f6
0cf5832
b1be9d3
6e0467c
48a0ab2
b86f008
971a401
fdfd3cd
39744c9
d7e0894
2f81989
d1f60cd
ce0a75f
a5f8de0
4dbcf0e
5c40c88
fa12be8
ab7598f
4b385c3
9d68b2f
a079606
f98ca7c
39b5590
1d3d994
f6ea598
41b9d21
f76b741
ab4545b
d83ec49
6505a65
83ca561
cd5f487
16ef35c
82870ac
c9f3585
3f23198
5bf0d44
14edfc4
27549c9
cfb8309
0e970ee
d298002
5c038b9
b25a9d4
aa9dd7b
0141dc8
c100b74
9e9d909
0eb78fc
ef65020
0973a53
c363c11
59c4d2a
4c83f2d
420d31c
2322f5f
eae85db
46df827
4264ff0
9daef0e
a04da62
5917caa
2d220f8
ea1b26d
d15a702
1a628cb
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change | ||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
@@ -0,0 +1,129 @@ | ||||||||||||||||||||||||||
Results and Output of DROP | ||||||||||||||||||||||||||
=========================== | ||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
DROP is intended to help researchers use RNA-Seq data in order to detect genes with aberrant expression, | ||||||||||||||||||||||||||
aberrant splicing and mono-allelic expression. By simplifying the workflow process we hope to provide | ||||||||||||||||||||||||||
easy to read and interpret html files and output files. This section is dedicated to explaining the relevant | ||||||||||||||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||||||||||||||||||||||
results files. We will use the results of the ``demo`` to explain the files generated.:: | ||||||||||||||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
#install drop | ||||||||||||||||||||||||||
mamba create -n drop_env -c conda-forge -c bioconda drop | ||||||||||||||||||||||||||
conda activate drop_env | ||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
mkdir drop_demo | ||||||||||||||||||||||||||
cd drop_demo | ||||||||||||||||||||||||||
drop demo | ||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
snakemake -c1 | ||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
Aberrant Expression | ||||||||||||||||||||||||||
+++++++++++++++++++ | ||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
html file | ||||||||||||||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||||||||||||||||||||||
######### | ||||||||||||||||||||||||||
Looking at the resulting ``Output/html/drop_demo_index.html`` we can see the ``AberrantExpression`` | ||||||||||||||||||||||||||
tab at the top of the screen. Following that the Overview tab contains links to the: | ||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
* Counting Summaries | ||||||||||||||||||||||||||
* For each aberrant expression group | ||||||||||||||||||||||||||
* split of local vs external sample counts | ||||||||||||||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||||||||||||||||||||||
* QC relating to reads and size factors for each sample | ||||||||||||||||||||||||||
* histograms relating to mean count distribution with different conditions | ||||||||||||||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||||||||||||||||||||||
* information about the expressed genes within each sample and as a dataset | ||||||||||||||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||||||||||||||||||||||
* Outrider Summaries | ||||||||||||||||||||||||||
* For each aberrant expression group | ||||||||||||||||||||||||||
* the number of aberrantly expressed gene per sample | ||||||||||||||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||||||||||||||||||||||
* how batch correction is done and the resulting lack of batch effects | ||||||||||||||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||||||||||||||||||||||
* which samples contain outliers | ||||||||||||||||||||||||||
* results table | ||||||||||||||||||||||||||
* Files | ||||||||||||||||||||||||||
* OUTRIDER files for each aberrant expression group | ||||||||||||||||||||||||||
* For each of these files you can follow the `OUTRIDER vignette for individual analysis <https://www.bioconductor.org/packages/devel/bioc/vignettes/OUTRIDER/inst/doc/OUTRIDER.pdf>`_. | ||||||||||||||||||||||||||
* tsv files | ||||||||||||||||||||||||||
* For each aberrant expression group | ||||||||||||||||||||||||||
* results.tsv | ||||||||||||||||||||||||||
* this tsv file contains only the significant genes and samples that meet the cutoffs defined in the ``config.yaml`` for ``padjCutoff`` and ``zScoreCutoff`` | ||||||||||||||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
Local result files | ||||||||||||||||||||||||||
################## | ||||||||||||||||||||||||||
Additionally the ``aberrantExpression`` module creates the file ``Output/processed_results/aberrant_expression/{annotation}/outrider/{drop_group}/OUTRIDER_results_all.Rds`` this file is the Rds object containing the entire OUTRIDER results table regardless of significance. | ||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
Aberrant Splicing | ||||||||||||||||||||||||||
+++++++++++++++++ | ||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
html file | ||||||||||||||||||||||||||
########## | ||||||||||||||||||||||||||
Looking at the resulting ``Output/html/drop_demo_index.html`` we can see the ``AberrantSplicing`` | ||||||||||||||||||||||||||
tab at the top of the screen. Following that the Overview tab contains links to the: | ||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
* Counting Summaries | ||||||||||||||||||||||||||
* For each aberrant splicing group | ||||||||||||||||||||||||||
* split of local (from internal BAM files) vs external sample counts | ||||||||||||||||||||||||||
* split of local vs merged with external sample splicing/intron counts | ||||||||||||||||||||||||||
* comparison of local and external log mean counts | ||||||||||||||||||||||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||||||||||||||||||||||
* histograms relating to junction expression before and after filtering and variability | ||||||||||||||||||||||||||
* FRASER Summaries | ||||||||||||||||||||||||||
* For each aberrant splicing group | ||||||||||||||||||||||||||
* the number of samples, introns, and splice sites | ||||||||||||||||||||||||||
* how batch correction is done and the resulting lack of batch effects | ||||||||||||||||||||||||||
* result table | ||||||||||||||||||||||||||
* Files | ||||||||||||||||||||||||||
* FRASER files for each aberrant splicing group | ||||||||||||||||||||||||||
* For each of these files you can follow the `FRASER vignette for individual analysis <https://www.bioconductor.org/packages/devel/bioc/vignettes/FRASER/inst/doc/FRASER.pdf>`_. | ||||||||||||||||||||||||||
* tsv files | ||||||||||||||||||||||||||
* For each aberrant splicing group | ||||||||||||||||||||||||||
* results_per_junction.tsv | ||||||||||||||||||||||||||
* this tsv file contains only significant junctions that meet the cutoffs defined in the ``config.yaml`` they are aggregated at the junction level. | ||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
Local result files | ||||||||||||||||||||||||||
################## | ||||||||||||||||||||||||||
Additionally the ``aberrantSplicing`` module creates the following file ``Output/processed_results/aberrant_splicing/results/{annotation}/fraser/{drop_group}/results.tsv``. | ||||||||||||||||||||||||||
This tsv file contains only significant junctions that meet the cutoffs defined in the ``config.yaml`` they are aggregated at the gene level. Any sample/gene pair is represented by only the most significant junction. | ||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
Mono-allelic Expression | ||||||||||||||||||||||||||
+++++++++++++++++++++++ | ||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
html file | ||||||||||||||||||||||||||
########## | ||||||||||||||||||||||||||
Looking at the resulting ``Output/html/drop_demo_index.html`` we can see the ``MonoallelicExpression`` | ||||||||||||||||||||||||||
tab at the top of the screen. Following that the Overview tab contains links to the: | ||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
* Results | ||||||||||||||||||||||||||
* For each mae group | ||||||||||||||||||||||||||
* the number of samples, unique genes, and aberrant events | ||||||||||||||||||||||||||
* a cascade plot that shows additional filters | ||||||||||||||||||||||||||
* MAE for REF: the monoallelic expression favors the reference allele | ||||||||||||||||||||||||||
* MAE for ALT: the monoallelic expression favors the alternative allele | ||||||||||||||||||||||||||
* rare: | ||||||||||||||||||||||||||
* if ``add_AF`` is set to true in ``config.yaml`` must meet minimum AF set by ``max_AF`` | ||||||||||||||||||||||||||
* additionally it must meet the inner-cohort frequency ``maxVarFreqCohort`` cutoff | ||||||||||||||||||||||||||
* histogram of inner cohort frequency | ||||||||||||||||||||||||||
* summary of cascade plots and results table | ||||||||||||||||||||||||||
* Files | ||||||||||||||||||||||||||
* Allelic counts | ||||||||||||||||||||||||||
* a directory containing the allelic counts of heterozygous variants | ||||||||||||||||||||||||||
* Results data tables of each sample (.Rds) | ||||||||||||||||||||||||||
* Rds objects containing the full results table regardless of MAE status | ||||||||||||||||||||||||||
* Significant MAE results tables | ||||||||||||||||||||||||||
* For each mae group | ||||||||||||||||||||||||||
* a link to the results tsv file. | ||||||||||||||||||||||||||
* Only contains significant MAE results based on ``config.yaml`` cutoffs for the alternative allele | ||||||||||||||||||||||||||
* Quality Control | ||||||||||||||||||||||||||
* QC Overview | ||||||||||||||||||||||||||
* For each mae group QC checks for DNA/RNA matching | ||||||||||||||||||||||||||
* Analyze Individual Results | ||||||||||||||||||||||||||
* An example analaysis that can be run using the Rds objects linked in the files subsection | ||||||||||||||||||||||||||
* performed on the first mae sample | ||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
Local result files | ||||||||||||||||||||||||||
################## | ||||||||||||||||||||||||||
Additionally the ``mae`` module creates the following files: | ||||||||||||||||||||||||||
|
||||||||||||||||||||||||||
* ``Output/processed_results/mae/{drop_group}/MAE_results_all_v29.tsv.gz`` | ||||||||||||||||||||||||||
* this file is the tsv results of all heterozygous variants regardless of significance | ||||||||||||||||||||||||||
* ``Output/processed_results/mae/{drop_group}/MAE_results_v29.tsv`` | ||||||||||||||||||||||||||
* this is the file linked in the html document and described above | ||||||||||||||||||||||||||
* ``Output/processed_results/mae/{drop_group}/MAE_results_v29_rare.tsv`` | ||||||||||||||||||||||||||
* this file is the subsetted tsv of ``MAE_results_v29.tsv`` with only the variants that pass the rare cutoffs | ||||||||||||||||||||||||||
* if ``add_AF`` is set to true in ``config.yaml`` must meet minimum AF set by ``max_AF`` | ||||||||||||||||||||||||||
* inner-cohort frequency must meet ``maxVarFreqCohort`` cutoff |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
follow slack comments