You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It just comes to me that miniprot might be used to identify homologous TEs (based on evolutionary distant TE protein sequence) because many TEs have conserved domains (and structures) although evolved much faster than typical protein coding genes.
Because non-active (ancient) TEs can have frameshift, internal stopcodon, and insertions (from other TEs), which are similar to introns, I set -j to 0 to ignore splicing sites. However, active (more recent) TEs are usually without introns, but miniprot penalises single exons to avoid pseudogenes.
I wonder what setting should I use to avoid the single-exon penalties? And any suggestion for this task with miniprot?
The text was updated successfully, but these errors were encountered:
It just comes to me that miniprot might be used to identify homologous TEs (based on evolutionary distant TE protein sequence) because many TEs have conserved domains (and structures) although evolved much faster than typical protein coding genes.
Because non-active (ancient) TEs can have frameshift, internal stopcodon, and insertions (from other TEs), which are similar to introns, I set -j to 0 to ignore splicing sites. However, active (more recent) TEs are usually without introns, but miniprot penalises single exons to avoid pseudogenes.
I wonder what setting should I use to avoid the single-exon penalties? And any suggestion for this task with miniprot?
The text was updated successfully, but these errors were encountered: