Each and All of Sequences adressed to indiviuals OTU, how can I solve this?

mhmtgenc · May 25, 2016, 6:23am

I have close to 130000 Sequences and when I use mothur to classify and obtain taxonomic information. I got full taxonomy for them all which seems accurate. But Each and all of Sequences adresses to indiviuals OTUs (I have close to 130000 OTUs), how can I solve this? where did I do wrong and here is my commands;

unique.seqs(fasta=.fasta)
align.seqs(candidate=.unique.fasta,template=a template file,flip=t,processors=12) filter.seqs(fasta=.unique.align) unique.seqs(fasta=.unique.filter.fasta,name=.names) dist.seqs(fasta=.unique.filter.unique.fasta,calc=onegap,countends=F,cutoff=0.03,output=lt,processors=12) cluster(phylip=.unique.filter.unique.phylip.dist,name=.names,method=average,cutoff=0.03)
classify.seqs(fasta=.fasta,template=a non gap template file,taxonomy=**.tax,cutoff=60) classify.otu(taxonomy=.taxonomy,list=*.unique.filter.unique.phylip.fn.list,label=0.03)

Kendra · May 25, 2016, 4:09pm

What are you sequencing? Can you do summary.seqs on your aligned fasta and copy the results here?

mhmtgenc · May 26, 2016, 6:21am

Start End NBases Ambigs Polymer NumSeqs
Minimum: -1 -1 0 0 1 1
2.5%-tile: 1855 2578 272 0 4 6139
25%-tile: 1903 2583 272 0 4 61383
Median: 1903 2632 274 0 5 122766
75%-tile: 1903 3726 274 0 5 184149
97.5%-tile: 1904 3732 274 0 6 239393
Maximum: 6852 6852 274 0 57 245531
Mean: 1908.5 3122.04 272.162 0 4.68964

of Seqs: 245531

mhmtgenc · May 27, 2016, 7:22am

16s rrna by the way

mhmtgenc · June 6, 2016, 10:56am

I need your help guys

pschloss · June 7, 2016, 1:26pm

But Each and all of Sequences adresses to indiviuals OTUs (I have close to 130000 OTUs), how can I solve this?

Sorry, but I don’t understand what you’re asking.

dwaite · June 7, 2016, 10:16pm

To start with, with your line

dist.seqs(fasta=*.unique.filter.unique.fasta,calc=onegap,countends=F,cutoff=0.03,output=lt,processors=12)

Can you try running it again with a higher cutoff? It’s explained in the FAQs here, but basically the cutoff during distance calculation isn’t the same as the distance you later cluster at. Usually the maximum distance in your matrix will be lower than the cutoff you use during the distance matrix.

mhmtgenc · June 28, 2016, 11:27am

I have ~130000 sequences and I have ~130000 OTUs. Each of my sequences annotated as individual OTUs. I think now it is more clear

pschloss · June 30, 2016, 6:08pm

At what OTU label? Have you tried the higher cutoff in dist.seqs as suggested by dwaite?

Topic		Replies	Views
I get too many OTUs. Like almost same as sequence number. Commands in mothur	3	1057	April 24, 2017
Problem with OTU classification mothur bugs	5	5590	April 19, 2010
taxonomy problems Commands in mothur	3	1343	May 31, 2016
classify.seqs() Commands in mothur	5	11916	March 10, 2010
Diversity analysis: How to build and compare OTUs from different samples with same sequence ID Commands in mothur	4	2131	June 21, 2016

Each and All of Sequences adressed to indiviuals OTU, how can I solve this?

of Seqs: 245531

Related topics