-
Notifications
You must be signed in to change notification settings - Fork 192
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MSG: Got a sequence without letters. Could not guess alphabet #127
Comments
Hi, |
Hi Andrew, |
Thanks for that, |
Hi Mushal, |
Many thanks @andrewjpage its working well :) |
Thanks for letting me know. |
Hi @andrewjpage I'm using roary 3.13.0 and have this problem:
|
I think the current version of roary is 3.14.0 |
Thanks for your response,
And what is the best approach to the warning, identify and remove the
sequences with many Ns? What is considered an acceptable threshold for Ns?
Kind regards.
…On Sat, Feb 29, 2020 at 3:38 PM Torsten Seemann ***@***.***> wrote:
I think the current version of roary is 3.14.0
That warning comes from bioperl and it usually means you have lots of -
or N letters in your sequence.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#127?email_source=notifications&email_token=ABGBQRXUI5V5I2TX6URMSWDRFGOAHA5CNFSM4BFPFLP2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOENMI5GI#issuecomment-593006233>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABGBQRSY5S3ELPE3JEORXBDRFGOAHANCNFSM4BFPFLPQ>
.
--
San Emmanuel James
Skype: jsan4christ
Mobile: UG +256752900304, SA +27 67 833 1444
----------------------------------------------------------------------------
The Lord is my shepherd, I shall not want! Psalms 23
|
I'm having trouble locating version 3.14.0 installer,
Thanks
On Sat, Feb 29, 2020 at 5:07 PM San Emmanuel James <
[email protected]> wrote:
… Thanks for your response,
And what is the best approach to the warning, identify and remove the
sequences with many Ns? What is considered an acceptable threshold for Ns?
Kind regards.
On Sat, Feb 29, 2020 at 3:38 PM Torsten Seemann ***@***.***>
wrote:
> I think the current version of roary is 3.14.0
> That warning comes from bioperl and it usually means you have lots of -
> or N letters in your sequence.
>
> —
> You are receiving this because you commented.
> Reply to this email directly, view it on GitHub
> <#127?email_source=notifications&email_token=ABGBQRXUI5V5I2TX6URMSWDRFGOAHA5CNFSM4BFPFLP2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOENMI5GI#issuecomment-593006233>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/ABGBQRSY5S3ELPE3JEORXBDRFGOAHANCNFSM4BFPFLPQ>
> .
>
--
San Emmanuel James
Skype: jsan4christ
Mobile: UG +256752900304, SA +27 67 833 1444
----------------------------------------------------------------------------
The Lord is my shepherd, I shall not want! Psalms 23
--
San Emmanuel James
Skype: jsan4christ
Mobile: UG +256752900304, SA +27 67 833 1444
----------------------------------------------------------------------------
The Lord is my shepherd, I shall not want! Psalms 23
|
Hi! I would also like to revive this issue. I am running roary (3.12.0) on a dataset consisting of 2170 bacterial genomes. The command I ran was the following:
The process runs seemingly fine and the correct output files are generated but I get the following error message twice
Also the core_gene_alignment.aln file is very small (seems to consist only of one gene or so) despite the summary statistics file stating that there should be 1141 core genes. I have previously tried to QC my genomes by running sendsketch and validated with Kraken on dubious ones. I also made a mash tree to double check and remove outliers and by removing assemblies with over 200 contigs. After this I used prokka v 1.12 for annotation (I know it's an old version). Is this error message due to low quality/high divergence among the genomes as suggested by some answers I have found or N's in the sequences or what do you think? And most importantly; how can I mitigate it? I visualised the nwk and gene_presence_absence.csv file in Phandango and I cannot see any genome behaving weirdly (eg containing very few core genes/being very divergent from the others) there. Thank you for your help! |
To follow up on this, is there anyway to identify the sequences that give rise to this error and modify them/exclude them? Since the error message was repeated twice I assume they are two? @andrewjpage @tseemann |
I had this problem too, but mine was caused by a roary version issue. When I initially installed conda, I didn't add a new channel to the conda config, which caused me to use:
and then use |
Hi
I got this error when I try to create a core alignment
Thanks
The text was updated successfully, but these errors were encountered: