I have close to 130000 Sequences and when I use mothur to classify and obtain taxonomic information. I got full taxonomy for them all which seems accurate. But Each and all of Sequences adresses to indiviuals OTUs (I have close to 130000 OTUs), how can I solve this? where did I do wrong and here is my commands;
unique.seqs(fasta=.fasta)
align.seqs(candidate=.unique.fasta,template=a template file,flip=t,processors=12) filter.seqs(fasta=.unique.align) unique.seqs(fasta=.unique.filter.fasta,name=.names) dist.seqs(fasta=.unique.filter.unique.fasta,calc=onegap,countends=F,cutoff=0.03,output=lt,processors=12) cluster(phylip=.unique.filter.unique.phylip.dist,name=.names,method=average,cutoff=0.03)
classify.seqs(fasta=.fasta,template=a non gap template file,taxonomy=**.tax,cutoff=60) classify.otu(taxonomy=.taxonomy,list=*.unique.filter.unique.phylip.fn.list,label=0.03)
Can you try running it again with a higher cutoff? It’s explained in the FAQs here, but basically the cutoff during distance calculation isn’t the same as the distance you later cluster at. Usually the maximum distance in your matrix will be lower than the cutoff you use during the distance matrix.