-
Notifications
You must be signed in to change notification settings - Fork 596
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
317 remove excess header values in VCF extract #7786
Conversation
da8a97a
to
ff6a114
Compare
c547147
to
45e4187
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good.
@@ -117,6 +117,7 @@ workflows: | |||
branches: | |||
- master | |||
- ah_var_store | |||
- rc-vs-317-remove-excess-headers |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we get rid of these before we merge the PR?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ideally yes, but with tests taking so long, sometimes we dont. It would be great to do a full clean up of all the extra branches in dockstore too
@@ -15,7 +15,7 @@ workflow GvsExtractCallset { | |||
|
|||
File interval_list = "gs://gcp-public-data--broad-references/hg38/v0/wgs_calling_regions.hg38.noCentromeres.noTelomeres.interval_list" | |||
File interval_weights_bed = "gs://broad-public-datasets/gvs/weights/gvs_vet_weights_1kb.bed" | |||
File gatk_override = "gs://broad-dsp-spec-ops/scratch/bigquery-jointcalling/jars/rc-add-AD-04112022/gatk-package-4.2.0.0-498-g1f53709-SNAPSHOT-local.jar" | |||
File gatk_override = "gs://broad-dsp-spec-ops/scratch/bigquery-jointcalling/jars/rc-update-headers-04142022/gatk-package-4.2.0.0-497-gda8a97a-SNAPSHOT-local.jar" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should these directory have the jira ticket number embedded (to see if the work has been merged into ah_var_store and so we can assume the directory and jar is stale / unneeded).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah I agree that would be good. With the combo of embedding the ticket number in the branch name and the "copy your custom GATK jar" script that Kristian shared the other day we seem to get that.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I like this! Maybe post-Charlie we can consolidate a set of rules for jars? It's certainly too haphazard now
Ran this jar on the MSSNG call set and compared one of the output VCFs with that generated using the original (default) jar - differences found were as expected:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM once Travis is happy. 👍
@@ -15,7 +15,7 @@ workflow GvsExtractCallset { | |||
|
|||
File interval_list = "gs://gcp-public-data--broad-references/hg38/v0/wgs_calling_regions.hg38.noCentromeres.noTelomeres.interval_list" | |||
File interval_weights_bed = "gs://broad-public-datasets/gvs/weights/gvs_vet_weights_1kb.bed" | |||
File gatk_override = "gs://broad-dsp-spec-ops/scratch/bigquery-jointcalling/jars/rc-add-AD-04112022/gatk-package-4.2.0.0-498-g1f53709-SNAPSHOT-local.jar" | |||
File gatk_override = "gs://broad-dsp-spec-ops/scratch/bigquery-jointcalling/jars/rc-update-headers-04142022/gatk-package-4.2.0.0-497-gda8a97a-SNAPSHOT-local.jar" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah I agree that would be good. With the combo of embedding the ticket number in the branch name and the "copy your custom GATK jar" script that Kristian shared the other day we seem to get that.
Before:
![image](https://user-images.githubusercontent.com/6863459/163501604-73d5fe47-f5e3-4e9a-966b-37f99857d4e7.png)
After:
![image](https://user-images.githubusercontent.com/6863459/163501590-8770d4a2-b23c-45c3-867a-d8356023e39c.png)