sub.sample question

james · February 22, 2014, 2:37pm

I have a question regarding the subsample command.
Here’s the version I am using:
Windows version
Running 64Bit Version
mothur v.1.32.1
Last updated: 1/6/2014

Here is the command I enter. If I understand it correctly, I am asking the program to subsample 1000 times from each group. I have 12 groups. I am sampling from the dist file created from my sequences.

mothur > sub.sample(shared=final.phylip.an.shared, size=1000, label=0.03)
Unable to open C:\mothur\in\final.phylip.an.shared. Trying output directory C:\mothur\out-SSURef\final.phylip.an.shared
Sampling 1000 from each group.
0.03

Output File Names:
C:\mothur\out-SSURef\final.phylip.an.0.03.subsample.shared

Here is what the output file looks like (partially):

label Group numOtus Otu0001 . . .
0.03 07 1221 386 . . .
0.03 08 1221 308 . . .
0.03 09 1221 317 . . .
0.03 16 1221 280 . . .

My question is, since I am asking the program to subsample my sequences 1000 times, how am I getting a total of 1221 OTUs for each group? Intuitively, I would expect a maximum total of 1000 OTUs for each group (if each read were completley unique to a separate OTU), but, in reality, a much lower numOTUs for each gorup. Discounting the 0’s, I get:

label Group numOtus Otu0001 . . .
0.03 07 181 386 . . .
0.03 08 177 308 . . .
0.03 09 187 317 . . .
0.03 16 145 280 . . .

The 1221 numOTUs seems to be the sum total of OTUs retrieved across all of the groups with some OTUs absent/present in each group. Those OTU’s with a “zero” in the column are being counted. Is it necessary to count the actual number of OTUs retrieved for each group for subsequent analyses or to port this data into other programs for analysis?

Thanks

pschloss · February 25, 2014, 9:42pm

The sub.sample command only subsamples once - size=1000 means take 1000 sequences once and make a new shared file. For things like dist.shared you can say iters=100 and it will generate 100 distance matrices based on taking 1000 sequences and then give you the average.

Pat

Topic		Replies	Views
Repeat sub.sample command Commands in mothur	1	1085	April 28, 2016
sub.sample feature Commands in mothur	2	3908	January 10, 2011
What's the rationale for sub.sample() after making OTUs Theory behind mothur	13	17559	May 4, 2013
help needed: subsample not carried through to classif.otu Commands in mothur	3	3316	June 19, 2012
How can we sub.sample with higher iterations Commands in mothur	4	1190	July 3, 2017

sub.sample question

Related topics