Hello, I have some questions regarding the construction of a tree. I have done my analysis using the OTU based approach and most my files were column formatted as in the 454 SOP. I don’t believe that a tree would add anything to my findings but a tree would provide some nice visuals (everybody loves decorated trees, don’t they?).
I started with
dist.seqs(fasta=final.pick.fasta, output=phylip, processors=16) #Output File Names: #final.pick.phylip.dist #It took 911 to calculate the distances for 47601 sequences.
clearcut(phylip=final.pick.phylip.dist) #Output File Names:
final.pick.phylip.tre
however, when visualize the tree, the tips are labelled with the names of the 47,601 unique sequence which is too much (and not OTU). Also, i attempted to colour the tree using the sequence abundance but with no luck since there is no such a file that contain unique sequence abundance by group (i.e. similar to the shared file for OTUs).
what i need help with is (if it makes sense) to
Truncate my “final.pick.fasta” to include only the ones that corresponds to my OTUs
( this is how i got my OTUs
dist.seqs(fasta=final.fasta, cutoff=0.15, processors=16) #Output File Names: #final.dist
cluster(column=final.dist, name=final.names) #changed cutoff to 0.0543412 #Output File Names: #final.an.sabund #final.an.rabund #final.an.list)
run again the dis.seqs and clearcut using the shortened fasta to generate a tree with a number of tips that is equivalent to the number of OTUs.
will this allow me to use my taxonomy file (from classify.otu) to decorate the tree?
will this also allow me to use my shared file decorate the tree i.e. colour different groups, abundance,…
it sound a long way around. But i think it is doable. any help would be great
great thanks, i somehow , when first looked at the get.oturep did not realize that there was a fasta option and i ended up running the get.oturep and getting a .names file only. (THAT I REALLY DIDN’T KNOW WHAT TO DO WITH)
It runs great now.
THHAAAAANNNNKKKKKSSSSS
I played around with sed for awhile and managed to get the headers to just the OTU IDs and that ended up working fine, but based on the previous messages in this post, it seems like this shouldn’t be necessary. Is there something I missed?