-
Notifications
You must be signed in to change notification settings - Fork 596
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
VS-222 dont hard code the dataset name! #7704
Conversation
@Argument( | ||
fullName = "dataset-id", | ||
doc = "ID of the Google Cloud dataset to use when executing queries", | ||
optional = true // I guess, but wont it break otherwise or require that a dataset be created with the name temp_tables? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm going to delete this comment, but why is projectId above optional?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is there an available GATK jar with these updates to test with?
new jar here: gs://broad-dsp-spec-ops/scratch/bigquery-jointcalling/jars/rc_testing_dataset_id_20220303/gatk-package-4.2.0.0-478-g29bb3da-SNAPSHOT-local.jar I suppose I should add it to the WDL as the default |
23df84d
to
4941fc9
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM 👍🏻
GvsCreateFilterSet.wdl failed recently for Morgan because of this bug. When run in a brand new project, filter model creation fails because we expect the project to have a hard coded dataset named "temp_tables" which is likely does not have. The workaround is simply to manually create one. This ticket removes the need for this dataset altogether.
This is removed, and instead, the default dataset is used (that the many other tables created in this pipeline use as the default)
able to reproduce with a dummy dataset name:
![Screen Shot 2022-03-03 at 10 44 39 PM](https://user-images.githubusercontent.com/6863459/156822409-a99d7068-169c-48a2-83ff-5bcc81cdbd2e.png)
tested here:
https://app.terra.bio/#workspaces/broad-dsp-spec-ops-fc/gvs_testing_ingest/job_history/1dd27d90-82c4-44e6-8172-15c10c8a9c7f