Tree.shared vs. Clearcut.

dwaite · December 11, 2011, 8:58pm

Hi,

I’m working with some sequence from several different samples that I want to compare using unifrac. I had done the analysis previously building my tree file using the tree.shared() command, but I recently went back and tried building the tree file using clearcut() and found that in the subsequent unifrac it gave a different result. It doesn’t bother me what the true result is (whether the communities are significantly different or not) but it worries me that I can get different results from the same data depending on which way I build the tree. Reading through the mothur wiki I’ve seen both methods used for performing unifrac so I don’t see a clear preference for performing the analysis. Can anyone enlighten me as to which results to trust?

Also, on a general note, is there a good rule of thumb for ideal sequence length when build the distance matrix for these sort of analysis? I ask because there seems to be a bit of a judgement call when screening the aligned sequences. I suppose it’s the choice between more sequences in the data set or fewer, longer, sequence to analyze.

pschloss · December 13, 2011, 12:14am

Within mothur, I would encourage one to use clearcut to build trees with sequences and tree.shared to build trees of groups. clearcut uses the neighbor joining algorithm whereas tree.shared uses average neighbor. They’re pretty different algorithms.

Topic		Replies	Views
Theory behind unifrac in mothur Theory behind mothur	3	3734	January 21, 2015
Tree for Unifrac.Weighted? Theory behind mothur	5	1517	December 5, 2017
cluster.split and unifraq Theory behind mothur	1	3274	January 14, 2013
Tree.shared Theory behind mothur	1	11310	February 11, 2010
Ordination using unifrac distances? Theory behind mothur	5	5785	August 18, 2014

Tree.shared vs. Clearcut.

Related topics