Phylo.diversity & UniFrac

jarrod_s · June 5, 2010, 8:24pm

Hi all,

Both the <phylo.diversity> and unifrac commands require the user to read in a tree. Thus I generated a NJ tree in from a phylip formated distance matrix generated in mothur.

Immediately prior to generating the distance file I performed a final <unique.seqs> command with the names option. From this I have an unique.fasta file and an unique.names file, both with the same number of rows (i.e. clones). But at this point I also have a *.groups file generated BEFORE the last unique.seqs step. So, the groups file has more rows (i.e. clones) than both the names and fasta files.

This is not an issue in other mothur commands but is an issue <phylo.diversity> and unifrac commands. Here mothur automatically generates a new group file with the same number of clones that appear in the distance matrix, effectively eliminating identical clones from different groups.

FINALLY to my question. If the final unique.seqs step collapses identical clones from different groups into a single clone, won’t this affect the results of phylo.diversity and unifrac? It seems that I should generate a separate distance matrix, and hence a NJ tree, from a none de-convoluted dataset so that information is not lost. Am I making sense?

westcott · June 7, 2010, 11:00am

The read.tree command has a name parameter, so you can include the names file generated by the unique.seqs command.

jarrod_s · June 7, 2010, 2:35pm

Thanks a lot. I did not realize that read.tree had a names option.

Topic		Replies	Views
unifrac with overlapping groups? Commands in mothur	5	1719	February 23, 2016
Theory behind unifrac in mothur Theory behind mothur	3	3711	January 21, 2015
Phylo.diversity Commands in mothur	1	2667	June 15, 2011
unifrac.convert, phylo.betadiversity don't seem to exist mothur bugs	2	3138	May 30, 2012
using unifrac after cluster.split Commands in mothur	2	1620	September 29, 2015

Phylo.diversity & UniFrac

Related topics