subsampling

jitendrakeshri · October 12, 2016, 2:11pm

Please suggest whether alpha diversity is also to be calculated on subsampled data?

Kendra · October 12, 2016, 2:50pm

yes, pick a number of sequences and subsample to that for both alpha and beta indicies

stevewhitemd · October 13, 2016, 5:35pm

Hi

I’ve been told the same by several of my microbiome colleagues and that is what I do.

The question for me: what value to pick?

Recent example: I have ~40 patient samples, and to that run added 10 reagent controls (blanks, etc). The nseqs for the patient samples ranged from 9,500 to 30,000. The highest nseqs for my reagent controls (I have a great tech ) was 400. So it was easy for me, in calculating alpha and beta indices, to set subsampling to 9500.

Is that right? And what would be a good rationale for setting the value?

Kendra · October 14, 2016, 2:50pm

I like to go a bit below my lowest sample. Because subsampling the sample that has 9500 to 9500 is different than subsampling the 30000 to 9500. I’d probably have gone with 7500. But this is probably a really minor point.

I’m still trying to figure out what to do with the negative controls, for now I’m processing them with the rest of the samples but they get dropped when subsampled (which is ok for me. clients have the data, they can see what OTUs show up in the neg and decide what to do with that information)

jitendrakeshri · October 20, 2016, 2:56pm

Thanks kmitchell for informative reply.

Topic		Replies	Views
Subsampling 3 different groups at same level Theory behind mothur	5	1666	May 31, 2018
sub.sample - upper limit Commands in mothur	6	4086	June 6, 2013
Relplication of sub.sample Commands in mothur	7	5066	November 8, 2012
tips on subsampling, feature request? Theory behind mothur	5	5299	February 4, 2014
Normalizing sequences in each sample Commands in mothur	8	7770	January 9, 2015

subsampling

Related topics