"cluster" vs "classify.seq"

Dakhoo · May 21, 2014, 12:23am

So I have a bit difficulty in understanding “cluster” and “classify.seq”.

“classify.seq”, let the user to assign their sequences to the taxonomy outline of their choices. so I assume basically “muthor” calculate the distance of each sequence to the taxonomy groups in the “.tax” files and decides what sequences should belong to which group.

In the other hand, there is “cluster”, which works on the distance matrix, and based on the distance matrix find out sequences that are close together and form clusters.

So in principle, either we have a set of OTUs and would like to see, what sequences belong to which, or we don’t have OTUs and would like to figure out potential OTUs.

here are my questions,

Is this my understanding correct ?
what is the measurement for calculating these distances ?
is this distance matrix, symmetric ? looks like a correlation matrix ?

Thank you

pschloss · May 21, 2014, 2:30pm

Is this my understanding correct ?

The taxonomic assignment uses the naive Bayesian aligner from Wang et al. that is used by the RDP. You should consult that paper for a description of the approach. Cluster assigns sequences to bins based on their similarity to the other sequences in the database.

what is the measurement for calculating these distances ?

is this distance matrix, symmetric ? looks like a correlation matrix ?

yes, the distances are symmetric. you can learn more about the distances at the dist.seqs wiki page.

Dakhoo · May 21, 2014, 4:19pm

Thank you.

I couldn’t find a direct link to the Wang method. I googled, :wang naive bayesian sequence and I got to http://www.ncbi.nlm.nih.gov/pubmed/17586664. Is that a correct reference ?

pschloss · May 21, 2014, 7:40pm

that’s the one!

Topic		Replies	Views
What Evolutionary Distance/Metric Theory behind mothur	2	2533	March 13, 2015
Taxonomy in classify seqs Theory behind mothur	3	3135	February 27, 2015
Clustering sequences Theory behind mothur	2	2064	December 21, 2015
Opus vs Otus ? Theory behind mothur	2	3410	April 8, 2013
Cluster.split Commands in mothur	1	1920	December 20, 2014

"cluster" vs "classify.seq"

Related topics