Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR prepares
sarscov2_illumina_full
for use by workflow launcher in Terra off the Terra Data Repo. The main requirement is that the bulk of the compute happens in a single automated all-in-one workflow that is launchable with strictly the data coming from the data repo plus any invariant workspace variables. This PR accomplishes this by incorporatingsarscov2_biosample_load
as a subworkflow at the beginning ofsarscov2_illumina_full
. This invocation is optional and only happens if the requiredArray[File] biosample_attributes
variable is an empty list. If this is empty, we interpret this to mean that BioSamples have not been registered and that the attributes table needs to be generated by calling sarscov2_biosample_load. The rest of the normal workflow proceeds as usual.After merging this PR, WFL will be able to automate the vast majority of compute after sequencing is complete, from BioSample registration, to demux, depletion, assembly, lineage calling, and preparation of all submission (genbank, sra, gisaid) packages. Users would then need to subsequently follow this run with
sarscov2_sequencing_reports
to generate a cumulative metadata tsv, and thensarscov2_data_release
if all looks good (the cumulative metadata tsv is required for delivery to CDC OAMD, but not NCBI or GISAID).