-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No flaA gene in blast results in version 1.18.0 #17
Comments
Thanks for raising this! Looks like this was a BLAST issue relating to the size of the SBT database after we updated it with new sequences. We're now right on the threshold where the max_target_seqs setting of blastn causes issues and I didn't catch it in prerelease testing. You can see the issue if you blast the assembly you are using against the sbt database: $ blastn -subject db/all_loci.fasta -query GCF_900119765.1_2532STDY5467631_genomic.fna -outfmt 6 | grep -c fla
0
$ blastn -subject db/all_loci.fasta -query GCF_900119765.1_2532STDY5467631_genomic.fna -outfmt 6 -max_target_seqs 50000 | grep -c fla
44 The default for max_target_seqs is 500. It looks like we get all the hits if we bump it to 545. I've increased it to 50,000 to future-proof against future DB updates and custom DBs. I pushed a fix to the main branch which should update on bioconda soon. |
Thank you very much. It looks like version 1.18.2 works as expected.
|
Hi,
I faced a problem while I was testing the version 1.18.0. el_gato cannot find ST since it cannot find the flaA gene in the blast. Tested genome is GCF_900119765.1
The text was updated successfully, but these errors were encountered: