sub.sample for use with the classify.otu command?

AOldham · September 6, 2011, 8:17pm

Hello all
I am trying to normalize the number of sequences so that I am analyzing the same number for each sample. That being said…I ran the sub.sample command using the shared and groups files with the size parameter, which works great with the summary single command (I can compare the various calculators for equivalent numbers of sequences for all of my samples). This was the command I used to normalize the number of sequences across the sample groups: sub.sample(shared=3C.final.an.shared, groups=3CAuto-3CMax-3CPower, size=10512).

The problem I am having concerns otu classification. Next, I would like to normalize the number of sequences for each sample for use with the classify.otu command (so that I can compare the relative abundance of various taxa for equivalent numbers of sequences).

This is the command I used for classification before the number of sequences for each group were normalized: classify.otu(list=3C.final.an.list, name=3C.final.names, group=3C.final.groups, taxonomy=3C.final.taxonomy, basis=sequence, cutoff=80, label=0.03).

How would I modify this command (or what other files do I need to generate) so that I get a classification for equivalent numbers of sequences for each group, rather than a classification for the original number of sequences? I assumed it would be necessary to also normalize for classification since more sequences = more otus.

Thanks
AO

pschloss · October 4, 2011, 12:37pm

How would I modify this command (or what other files do I need to generate) so that I get a classification for equivalent numbers of sequences for each group, rather than a classification for the original number of sequences? I assumed it would be necessary to also normalize for classification since more sequences = more otus.

I don’t think this is actually necessary since the relative abundances shouldn’t really change much. But… you could run sub.sample on the list files with list, group, name file options…

sub.sample(fasta=esophagus.unique.fasta, name=esophagus.names, group=esophagus.groups)

Hope this helps…
Pat

Topic		Replies	Views
classify.otu with normalised data Commands in mothur	17	17600	September 10, 2014
help needed: subsample not carried through to classif.otu Commands in mothur	3	3316	June 19, 2012
sub.sample Commands in mothur	8	12756	April 12, 2012
classify.otu after subsampling Commands in mothur	1	1206	August 15, 2016
Normalization Commands in mothur	1	4332	May 8, 2012

sub.sample for use with the classify.otu command?

Related topics