Trimming Suggestion for 16S V3 #1791

Jiayonglai123 · 2023-08-02T02:41:57Z

Hi, I'm a newbie in dada2 pipeline,
my samples are from Illumina Miseq 100, 16S V3 in casava single end so no merging is required,
my samples are 16S V3 in casava single end so no merging is required,
primers were cut using cutadapt before going into dada2 for quality check and downtrim analysis.
I read through some issues posted in this github, can I get some suggestion for the result?

should I cut the length in according to each sample or should I find the best cut for all?
can I ask about the black/ grey colored part means?
I try cutting it at 160bp, but in error rate seems not so good, should I try lower value to trim( around 135bp?)
for plotQualityProfile() can I do something like a interactive ploty so that I can cut the best quality for all samples.
Thanks beforehand

benjjneb · 2023-08-03T01:26:58Z

Same for all
It is the heatmap of the quality scores. The plots you posted strongly suggest binned quality scores. How sure are you that this sequencing was done on a MiSeq? (vs. a MiniSeq or NovaSeq etc)
truncLen will throw away all reads that don't reach the truncation length, and your pre-processing has resulted in a majority of reads not being 160nts long (see the red line -- the cumulative distribution of read lengths). Given your preprocessing, only truncLen shorter than the first drop around 135 seem appropriate.
We don't have interactive plots implemented in the package.

Jiayonglai123 · 2023-08-04T01:45:39Z

Thanks for your prompt reply, and good insights,

about the heatmap, does is meant that the darker the map is the higher the quality of reads I got?
I just checked with my sequencing provider, it's Illumina's iSeq 100 (thanks for pointing out my mistake)
really apprieciate your time and efforts to answer all the questions in the forum with quick reply speed.
I just tried out the error rate estimation, while it is not as nice as shown in the tutorial, witht truncate of 135 it has less error out at the bottom of the plot, do you think it is good to proceed with the next step with this plot?

benjjneb · 2023-08-04T16:57:34Z

the darker the map is the higher the quality of reads I got?

The darker the cell in the heatmap, the more reads had that quality at that position.

I just tried out the error rate estimation, while it is not as nice as shown in the tutorial, witht truncate of 135 it has less error out at the bottom of the plot, do you think it is good to proceed with the next step with this plot?

You are OK to proceed. The weird quality score fitting with binned Q scores is a known issue, but doesn't seem to affect operation of dada2 much. You can read more: #791

Jiayonglai123 · 2023-08-06T03:30:03Z

Thanks for your help and support.
Sincere gratitute for your prompt and quick reply.

Jiayonglai123 closed this as completed Aug 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Trimming Suggestion for 16S V3 #1791

Trimming Suggestion for 16S V3 #1791

Jiayonglai123 commented Aug 2, 2023

benjjneb commented Aug 3, 2023

Jiayonglai123 commented Aug 4, 2023 •

edited

Loading

benjjneb commented Aug 4, 2023

Jiayonglai123 commented Aug 6, 2023

Trimming Suggestion for 16S V3 #1791

Trimming Suggestion for 16S V3 #1791

Comments

Jiayonglai123 commented Aug 2, 2023

benjjneb commented Aug 3, 2023

Jiayonglai123 commented Aug 4, 2023 • edited Loading

benjjneb commented Aug 4, 2023

Jiayonglai123 commented Aug 6, 2023

Jiayonglai123 commented Aug 4, 2023 •

edited

Loading