Too many unclassified

umjdf · October 12, 2011, 4:03pm

Hi

I’m trying to analyze sequences from cecum, ileum content, ileum tissue etc., but everything I’ve tried has left me with a large percentage of my sequences unclassified at the phylum (and genus) level. I’ve tried using silva.bacteria.rdp6.tax, silva.bacteria.rdp.tax, silva.bacteria.silva.tax as my templates in classify.seqs with a cutoff of 80, 60 and 40. The template silva.bacteria.rdp6.tax has given me the best results thus far at a cutoff of 40, but using that cutoff is really low, so I’d like to use a higher one. I’m out of options as to what to try to do.

Is it something in my classify.seqs that would cause this? Or at another prior command? Are there any other templates you’d suggest using?

I’m aware we expect a large amount of unclassified in these environmental samples, but the %unclassified I’m getting is much higher than other datasets previously analyzed in my lab from the same sample types.

Thanks!

pschloss · October 14, 2011, 8:44pm

Hmmm… You might try the RDP training set:

http://www.mothur.org/w/images/4/49/RDPTrainingSet.zip

umjdf · October 24, 2011, 7:20pm

Ok… and if this didn’t work? Try…?

pschloss · October 25, 2011, 11:28am

Are you sure the sequences are in the “right” direction? We don’t automatically flip the sequences at this point…

shuixia100 · December 4, 2011, 9:55am

What is the average length of your sequences, it the length is less than 100bp, I think is reasonable to get very high unclassified. There is also another way that you could try to use GAST for classification .

Topic		Replies	Views
classify seqs V1.19 mothur bugs	8	9386	July 11, 2011
classification leads to many unclassified Commands in mothur	5	6887	July 28, 2010
Classifying Sequences Commands in mothur	5	4341	March 21, 2012
classify.seqs Commands in mothur	1	910	March 13, 2017
Bacteria_unclassified Theory behind mothur	10	2513	November 30, 2021

Too many unclassified

Related topics