Cluster()--explanation for clustering methods

akoid · February 15, 2011, 5:12pm

Hi,

None of the links for any of the clustering methods have text. If I am not mistaken, I have read a page with explanations for the different methods. Is that something that can be put back on fairly easily?

Thanks,
Amy

wuyun · July 8, 2011, 2:47pm

You can just google it, here’s some information might be helpful:
Nearest neighbor:

Furthest neighbor:

However, I don’t know how the Average neighbor is calculated in mothur. Wish someone else can give an explanation of the algorithm of the average neighbor method used here.

pschloss · July 8, 2011, 7:17pm

Its aka the UPGMA algorithm.

wuyun · July 10, 2011, 1:49pm

Hi, Pat, I’ve run two clustering analysis using different tools: mothur and RDP tools
and even when I chose same method “furthest”, I got very different result, for example:
from mothur, I got:
cutoff cluster numbers
0.01 25455
0.02 15008
0.03 11038
…

from RDP, I got:
0.01 10955
0.02 6550
0.03 4502
…

I wonder how this difference comes out? I guess there might be some treatment mothur did with the abundance informaion(with name option), which RDP didn’t do that, since RDP does not have this option, right?

pschloss · July 11, 2011, 7:56pm

Yeah, I can’t speak to the RDP pipeline since I don’t use it because I’m not a fan of their data curation or alignment methods. The number after the cutoff is the size of the dominant OTU at that cutoff. So if you aren’t feeding back in the abundance of each sequence that will be different.

Pat

wuyun · July 12, 2011, 9:33pm

The number I listed should be the actual OTU numbers (I got this number from the 2nd column of .rabund file, instead of from the 2nd column of .sabund file, which is what you pointed out above.)

Anyway, that won’t affect the conclusion that mothur uses more information to do a better job.

Topic		Replies	Views
Talmudic question #1 Theory behind mothur	6	13237	April 16, 2010
From tree to clusters Theory behind mothur	1	2758	February 18, 2014
Cluster at average distance Commands in mothur	3	3752	January 4, 2011
Clustering sequences Theory behind mothur	2	2065	December 21, 2015
average neighbor clustering problem mothur bugs	6	7517	March 26, 2011

Cluster()--explanation for clustering methods

Related topics