setting the clustering cutoff

hkallen · November 28, 2011, 7:16pm

Hello. When using the cluster command, mothur is changing the cutoff value to suit her needs. For example, I implemented dist.seqs with cutoff=0.10, but when I ran cluster mothur changed the cutoff to ~0.05.

mothur > cluster(column=AllButInt25.final.dist, name=AllButInt25.final.names)
********************#****#****#****#****#****#****#****#****#****#****#
Reading matrix:     |||||||||||||||||||||||||||||||||||||||||||||||||||
***********************************************************************
changed cutoff to 0.0475441

Output File Names: 
AllButInt25.final.an.sabund
AllButInt25.final.an.rabund
AllButInt25.final.an.list

It took 1050 seconds to cluster

Why does mothur do this, or how can I prevent mothur from doing this? Thanks a lot!
Heather

westcott · November 29, 2011, 11:58am

This is one of our common questions, here’s Pat’s explanation, “This is a product of using the average neighbor algorithm with a sparse distance matrix. When you run cluster, the algorithm looks for pairs of sequences to merge in the rows and columns that are getting merged together. Let’s say you set the cutoff to 0.05. If one cell has a distance of 0.03 and the cell it is getting merged with has a distance above 0.05 then the cutoff is reset to 0.03, because it’s not possible to merge at a higher level and keep all the data. All of the sequences are still there from multiple phyla. Incidentally, although we always see this, it is a bigger problem for people that include sequences that do not fully overlap.” I would recommend increasing the cutoff to 0.25.

Topic		Replies	Views
cutoff not working correctly in cluster command Commands in mothur	5	5051	March 2, 2012
change in clustering cutoff Commands in mothur	3	2724	September 11, 2013
cluster: cutoff changed lower than demanded Commands in mothur	6	3406	June 22, 2015
cluster cutoff problem Commands in mothur	4	2307	May 28, 2016
Cluster Commands in mothur	1	1134	August 5, 2015

setting the clustering cutoff

Related topics