Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Suggestions of input datasets I used for braker2/3 #885

Open
hungweichen0327 opened this issue Nov 19, 2024 · 2 comments
Open

Suggestions of input datasets I used for braker2/3 #885

hungweichen0327 opened this issue Nov 19, 2024 · 2 comments

Comments

@hungweichen0327
Copy link

hungweichen0327 commented Nov 19, 2024

Dear community,

I used braker2 and braker3 respectively to do gene prediction in a tree genome (genome size ~900Mb).

The input data:

  • RNA-seq: RNA extracted from the leaves of the targeted species.
  • Protein database: orthoDB11- Viridiplantae

The gene number and BUSCO result is shown below:
image

Based on the result, the number of genes in the braker.gtf from braker3 is quite less (25943) than braker2 (48396). I know that braker3 only contained those transcripts in the result with very high support by the RNA-Seq and protein evidence. Thus, it's normal that the gene number in braker3 is lower than that in braker2.

However, the gene number in the phylogenetically closed related species is about 40000-50000 genes.

I regarded that the lower number of genes in braker3 is probably related to the RNA-seq data. We only have the RNA-seq from leaves. It's common for people to use RNA-seq from leaves, roots, flowers, and seeds. But it's difficult for us to obtain the RNA-seq from other tissues in addition to the leaves. Do you have any suggestions how to improve the result?

Thank you.

@bijendrabio
Copy link

bijendrabio commented Dec 2, 2024

Similar issue and I am getting few genes despite using same number of RNASeq data: #894

@Terives
Copy link

Terives commented Dec 12, 2024

I had the same issue with fungal genomes

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants