classification

sje062 · May 22, 2017, 9:14am

Hi mothur forum,

may I have a solution, if possible, on how to further
group sequences from an OTU that is defined by taxonomy,
Bacteria unclassified? I have the rep seq sequence but
like to know more about this big group of barely Silva v128
recognised sequences.

Sigmund

Kendra · May 22, 2017, 2:44pm

try blasting the rep seq against Ref_seq

sje062 · May 23, 2017, 1:56pm

Thanks, I did that (89% closest) and could do it for
all but it would have been easier if the
ca 6000 sequences could, somehow, have been
split into smaller groups so I could blast representatives
from each such subcluster. The sequences
are from different regions of the 16S so don’t think
they align well. The clustering by
phylotypes returned that big OTU classified
as Bacteria. I suspect this OTU includes
a lot of different sequences. Further comments
whould be helpful, on how to cluster unaligned
unclassified 16S sequences.
Sigmund

Kendra · May 23, 2017, 5:23pm

What are your samples? for things like soil, there won’t likely be many closer relatives in ref_seq. You just have to embrace the unknown

If you really want to try, you can cluster the unknown into a higher level OTU (say 5% rather than 3%) and blast those reps.

sje062 · May 23, 2017, 7:32pm

Thanks again for comment.
Samples are coral. That OTU is just Bacteria unknowns,
label 1. I could try for example label 3 with the phylotype command
and see what happens.

Kendra · May 23, 2017, 9:34pm

label 3 will be unknown

sje062 · May 26, 2017, 7:46am

I am searching for a method that group 16S sequences
without using an alignment or the taxonomy.

Kendra · May 26, 2017, 1:37pm

why don’t you want to align them?

sje062 · May 26, 2017, 4:26pm

They do not all overlap in the same region because from different
primer sets. I like to compare sequences from different studies,
downloaded from the NCBI.

Kendra · May 26, 2017, 6:23pm

ah, then you are stuck with taxonomy. You could use the approach that qiime uses where they clustered a database then match seqs to it and report the database sequence. But if your sequences aren’t in a database that approach is out.

Topic		Replies	Views
unclassified sequences? Commands in mothur	3	1612	March 21, 2017
NCBI database Commands in mothur	10	2716	November 8, 2018
classify.otu vs classify OTU representative sequence Commands in mothur	2	2711	November 5, 2014
Taxonomy results of mothur do not match with NCBI blast results	5	639	October 14, 2022
OTU classification in taxonomy file and RDP classification of rep sequence don't agree Theory behind mothur	8	2829	July 30, 2017

classification

Related topics