Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add normalize_gisaid_fasta.sh for issue #53 #59

Merged
merged 6 commits into from
Mar 15, 2020

Conversation

brianpardy
Copy link

Call as:

scripts/normalize_gisaid_fasta.sh data/gisaid_cov2020_sequences.fasta data/sequences.fasta 15000

Or use params.min_length when calling from Snakefile. Defaults to 15000 if unspecified.

I hope this is proper etiquette to submit this way.

Asking snakemake for "data/sequences.fasta" was throwing an error due to ambiguous DAG. This file can be created by either rule download or rule gisaid. There might be a better solution here, but in order to merge, I'm just going to remove from snakefile.
@trvrb
Copy link
Member

trvrb commented Mar 15, 2020

Hi @brianpardy. Thanks so much for this PR. I'm sorry it took so long to review. This works exactly as it should.

@trvrb trvrb merged commit 4ee884d into nextstrain:master Mar 15, 2020
oneillkza added a commit to oneillkza/ncov that referenced this pull request Mar 31, 2020
Add detailed steps for how to obtain and normalize GISAID fasta, as resolved in nextstrain#53 and nextstrain#59 .
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants