-
Notifications
You must be signed in to change notification settings - Fork 596
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rework Hail script generation [VS-616] #8034
Conversation
Codecov Report
Additional details and impacted files@@ Coverage Diff @@
## ah_var_store #8034 +/- ##
================================================
Coverage ? 43.628%
Complexity ? 21865
================================================
Files ? 2173
Lines ? 165004
Branches ? 17792
================================================
Hits ? 71988
Misses ? 87583
Partials ? 5433 |
59e32c7
to
71ad359
Compare
import hail as hl | ||
|
||
unslashed_tmp_dir = tmp_dir if not tmp_dir.endswith('/') else tmp_dir[:-1] | ||
hl.init(tmp_dir=f'{unslashed_tmp_dir}/hail_tmp_general') |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nice!
vqsr_tranche_data=args('vqsr_tranche_data'), | ||
reference_genome=rg38, | ||
final_path=vds_output_path, | ||
tmp_dir=f'{unslashed_tmp_dir}/hail_tmp_import_gvs' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is where we will put:
intermediate_resume_point=x
and also do we want to add:
partitions_per_sample=0.4,
2eae410
to
f3dc1fb
Compare
Github actions tests reported job failures from actions build 3146642435
|
1b7b427
to
926955c
Compare
Github actions tests reported job failures from actions build 3153749161
|
Splits the original script GVS to Hail import script into two smaller scripts. The first script creates the "MVP" Hail VariantDataset (VDS), the second script implements all further AC/AN/AF and VAT input processing. No Avro file literals are embedded in either script so both scripts are much smaller than the previous single script they replace.
Successful Quickstart run here: https://app.terra.bio/#workspaces/gvs-dev/mlc%20GVS%20Quickstart%20v3/job_history/2e4f192f-1042-4ee5-b1e3-3c00c97e620b
VDS generated from this script: gs://fc-a1621719-20ea-471d-a0ef-a41383dc76bd/submissions/2e4f192f-1042-4ee5-b1e3-3c00c97e620b/GvsExtractAvroFilesForHail/4124ba9e-4220-47f4-af46-8a8f1ba9ca66/call-OutputPath/2022-09-23-74fb33b4/gvs_export.vds
Tieout VCF generated from this VDS: gs://fc-a1621719-20ea-471d-a0ef-a41383dc76bd/submissions/2e4f192f-1042-4ee5-b1e3-3c00c97e620b/GvsExtractAvroFilesForHail/4124ba9e-4220-47f4-af46-8a8f1ba9ca66/call-OutputPath/2022-09-23-74fb33b4/tieout.vcf