Pre.cluster parameter diff tuning

Ray · May 16, 2022, 9:25pm

Hi all,

I’m using default diff=2 for pre.cluster command. However, there are only a small reduction in number of unique reads (2.2 million to 2 million). Since there are so many unique reads, should I set a larger diffs, e.g., diffs =2 or diffs = 5 to reduce the number of unique reads. My data is from V3 region and I’ve customized alignment of silva to V3 region.

Thanks,
Ray

pschloss · May 17, 2022, 5:30pm

The rule of thumb that I recommend is 1 diff per 100 nt of sequence data. If you are sequencing the V3 region, that is ~195 nt (Customize your reference alignment for your favorite region). So I would not use more than diffs=2

Pat

system · May 27, 2022, 5:30pm

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Pre.cluster diffs option Commands in mothur	1	1346	June 16, 2015
Diffs value for pre.cluster Theory behind mothur	2	670	June 26, 2020
Pre.cluster Commands in mothur	3	3149	July 30, 2012
Pre.cluster diff number setting based on sequence lenght Commands in mothur	3	755	March 9, 2020
pre.cluster to denoise mothur bugs	1	3584	July 30, 2012

Pre.cluster parameter diff tuning

Related topics