mothur

How can I classify OTUs to "SPECIES" level with mothur?

I have so many sequences that I would like to classify to the “species” level with mothur . I was able to create OTUs to Genus level but couldn’t be able to do to species level? can someone explain me that? (which database file should I choose for example for 16s rRNA sequences?)

Please help me I am getting pretty lost :slight_smile:

Can you provide more information?

What commands did you run and with what parameters? When you say you couldn’t classify to the species level, what does the output for an OTU in your taxonomy file look like? And perhaps most importantly which database did you classify with? (SILVA database does not have species level information).

Remember that the classification is dependent on the database you are classifying against. Many organisms in the databases aren’t classified to species because we don’t know what species they are. Second, v4 may not contain enough information to classify to species even if closely related species are in the database-check out v4 for E. coli and Shigella for an example of this.

Thanks for the reply and here is my output;
“OTU Size Taxonomy
Otu00001 25137 Bacteria(100) Proteobacteria(100) Betaproteobacteria(100) Burkholderiales(100) Alcaligenaceae(100) Bordetella(99)
Otu00002 15182 Bacteria(100) Proteobacteria(100) Alphaproteobacteria(100) Rhodobacterales(100) Rhodobacteraceae(100) Paracoccus(100)
Otu00003 2758 Bacteria(100) Proteobacteria(100) Betaproteobacteria(100) Burkholderiales(100) Alcaligenaceae(100) Bordetella(99)
Otu00004 2493 Bacteria(100) Proteobacteria(100) Alphaproteobacteria(100) Rhodobacterales(100) Rhodobacteraceae(100) Paracoccus(100)
Otu00005 1516 Bacteria(100) Proteobacteria(100) Betaproteobacteria(100) Burkholderiales(100) Alcaligenaceae(100) Bordetella(77)
Otu00006 1436 Bacteria(100) Proteobacteria(100) Alphaproteobacteria(61) Rhodobacterales(52) Rhodobacteraceae(52) unclassified(100)”

And can’t be sure which database I should use?

And thank you too my friend and again my problem is not knowing how to choose a database and how to modify it please help on his?

Some (including me) would argue that you’re asking too much to try and get to the species level using 250 nt from a 16S rRNA gene. This is why the databases generally do not include species level names. The greengenes database does provide some species, but only for about 10% of the genera. So that really isn’t very helpful. If you have a group that you really want to take to the species level, you would have to add multiple sequences per genus and multiple sequences per species and provide species-level taxonomy information for those sequences.

Pat

here they claim it’s possible for 14 species vs Silva v123 https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0176555

That paper is wrong. They got way out over their skis on the claims they made in there.

:skier::skier::skier:

so pleased there’s an emoji for this

1 Like