Trainset Necessary?

AndrewTing · September 2, 2020, 3:08am

I am doing V3V4 16S amplicon sequencing for my project but I am quite new to these bioinformatics.

For the Classify.seqs, is it necessary to have trainset for reference and taxonomy? Can I use silva.nr_138align for reference and silva.nr_v138.tax for taxonomy?

Is it also necessary to have metada during make.biom? If so, how to I get them?

Thank you

pschloss · September 3, 2020, 2:59pm

That’s fine - “trainset” is the name of the RDP training set. “silva.nr_v138” is the name of the SILVA training set.

Pat

AndrewTing · September 4, 2020, 3:34am

Thanks!

After I classify them to taxonomy, some of the bacteria’s are genus_unclassified.

For example:
1 sequence is classified as enterobacteriaceae;enterobacter
Another sequence is classified as enterobacteriaceae;enterobacteriaceae_unclassified

pschloss · September 4, 2020, 12:15pm

That means it didn’t have enough confidence to classify the second sequence to the genus level.

Pat

system · September 14, 2020, 12:15pm

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Which reference to use for classify.seqs? Commands in mothur	2	1356	March 14, 2016
different taxonomy files for classify.seqs Theory behind mothur	1	2150	December 5, 2016
Impact of training sets on classification of high-throughput Journal club	5	14119	August 14, 2015
classify.seqs using the "trainset14_032015.pds.tax" as a reference file Commands in mothur	2	1364	July 19, 2016
Reference Database(s) for 18S Theory behind mothur	2	901	July 12, 2019

Trainset Necessary?

Related topics