Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Apollo generates the gff3 with boundaries for the longer feature when two overlap in same reading frame #10

Open
jrr-cpt opened this issue May 29, 2019 · 1 comment
Labels

Comments

@jrr-cpt
Copy link

jrr-cpt commented May 29, 2019

@MoffMade @BeaverThing Noticed an error in the coordinates that are generated in the gff3 for a feature when two genes that are in the same reading frame are called. Usually this happens when we are comparing the possible lengths for a particular gene: we call both, check the nucleotides in the start and SD, then delete one. What I noticed in the Spanins Test Genome organism is that if you called the longer possible ORF first, then ended up deleting it, the shorter feature still had the gene coordinates of the longer feature, though child coordinates are correct. I can reproduce this error by simple order of calling and deleting.

For example, I noticed that the output of this OSP tool in the evidence track was extending the feature out in the 5' direction from the start.
Screen Shot 2019-05-29 at 15 08 08
This was actually correct because the gff3 for the feature had the longer coordinates:
Screen Shot 2019-05-29 at 15 07 59
And by deleting the gene, then calling the MGA (light blue) feature, followed by the Glimmer (dark blue/shorter) feature, both features end up with the same gene coordinates. Then it doesn't matter which gets deleted, because the coordinates are already set.
Screen Shot 2019-05-29 at 15 08 19

Perhaps a good question for Apollo/Nathan?

@jrr-cpt jrr-cpt added the bug label May 29, 2019
@MoffMade
Copy link

As discussed in the meeting earlier, lets put this on hold and test similar situations in the new Apollo version, it may have been handled in the updates.

For now, it looks like you can "reset" the coordinates by selecting the problem feature and hitting one of the "Set translaition start" options in the rightclick menu and then undoing that action. It seems to be due to overlapping gene feature calls in the same reading frame

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants