Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Would this pipeline also perfectly suitable for Illumina PE250bp reads? #8

Open
shanexuuu opened this issue Jul 15, 2024 · 1 comment

Comments

@shanexuuu
Copy link

Hi,

I am using this pipeline to reanalyze some data from Illumina Novaseq PE250bp.

And I have noticed that in this paper , they sequenced at Hiseq PE250bp (McNichol, J., Berube, P., Biller, S., Fuhrman, J., 2021. [Evaluating and Improving SSU rRNA PCR Primer Coverage for Bacteria, Archaea, and Eukaryotes Using Metagenomes from Global Ocean Surveys (https://journals.asm.org/doi/10.1128/mSystems.00565-21). mSystems. 6(3), e00565-2).

And in this paper, they sequenced at Miseq PE300bp (Yeh, Y.C., McNichol, J., Needham, D., Fichot, E., Berdjeb, L., Fuhrman, J., 2021. Comprehensive single-PCR 16S and 18S rRNA community analysis validated with mock communities, and estimation of sequencing bias against 18S. Environmental Microbiology. doi: 10.1111/1462-2920.15553.)

Would you think the PE250 reads also work perfectly with this pipeline?

Many thanks!
Shane

@jcmcnch
Copy link
Owner

jcmcnch commented Jul 28, 2024

Hi Shane,

Sorry for the delay in getting back to you. To answer your question, yes, there is no reason why the pipeline shouldn't work with PE250, in fact that's the length that generally gives better overall quality in our experience vs PE300.

Note though that there are some potential issues with NovaSeq regarding the error model training caused by the changes in the way quality scores are reported vs. the old Illumina chemistry (see: benjjneb/dada2#791). Also, there is an increased chances of index-hopping with NovaSeq vs MiSeq / the old HiSeq that we used (i.e. patterned flow cell vs bridge amplifiication see: https://www.illumina.com/techniques/sequencing/ngs-library-prep/multiplexing/index-hopping.html), so you should verify that you are using UDI barcoding approach for your libraries to avoid this happening, otherwise you will see ASVs bleeding through from sample to sample. I've actually not used NovaSeq data successfully myself due to these issues, so am not up to date on the current issues / concerns so suggest asking around and checking the relevant threads on dada2's github, qiime2 forum, etc.

Hope that helps,
Jesse

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants