How does sub.sample function works

EllistonV · May 1, 2025, 9:39pm

Hello,

I just want to know how the sub.sample() function works. How does the function determines how many sequences to remove from each OTU/ASV? does it remove more sequences from larger ASVs; is there a ratio kept between ASVs, is the relative abundance of each ASV affected? Are low abundance ASVs eliminated (I see that some singletons, doubletons are eliminated)? Thanks for your help!

Regards,
Elliston

pschloss · May 5, 2025, 1:25pm

Hi - it randomly grabs the specified number of sequences from each sample. It does it empirically more abundant OTUs will be sampled more often than rarer ones. If a sample has a lot more sequences than the desired threshold it will remove more rare OTUs from the sample. If a sample has fewer than the desired threshold of sequences that sample will be removed. if you don’t give it a threshold then it will use the size of the smallest sample. This function will only do one sampling of each sample. Using dist.shared and summary.shared gives the option of doing many subsamplings and then reporting the average of the alpha or beta diversity metric over those subsamplings.

Pat

EllistonV · May 6, 2025, 3:15pm

Oh, okay! This makes a lot of sense. Thank you very much!

EllistonV · May 6, 2025, 9:06pm

Also, the sub.sample function would not be the same as rarefying the data, correct? Or is it the same?

pschloss · May 8, 2025, 8:00pm

sub.sample outputs a shared file, the rarefaction commands output other file formats. subsampling is effectively rarefaction with a single randomization

Pat

EllistonV · May 10, 2025, 5:49am

Alright, thank you very much!

system · May 20, 2025, 5:50am

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
sub.sample - upper limit Commands in mothur	6	4063	June 6, 2013
Is there way to subsample multiple times? Commands in mothur	2	798	April 24, 2018
Rarefaction or sub-sampling? Commands in mothur	19	5515	May 21, 2020
What's the rationale for sub.sample() after making OTUs Theory behind mothur	13	17559	May 4, 2013
Repeat sub.sample command Commands in mothur	1	1085	April 28, 2016

How does sub.sample function works

Related topics