sub.sample - question about "size" and "persample"

Commands in mothur

poroyko April 4, 2013, 4:53pm 1

Good day!
I am working with sequences created in MiSeq SOP by make.contigs command.
For my data I would like to make smaller FASTA and group files with 10,000 reads per each barcode using:

sub.sample(fasta=xxxx.trim.contigs.fasta, group=xxxx.contigs.groups, size=10000, persample=true)
I am a little pazelled with “size” and “persample” options. Is the command line above correct?

Another question. Several barcodes are having read counts below 10K. How “sub.sample” treats “<10K” cases?

collect all available reads to the new file
reject the entire group

Thank you in advance!

pschloss April 4, 2013, 5:35pm 2

sub.sample(fasta=xxxx.trim.contigs.fasta, group=xxxx.contigs.groups, size=10000, persample=true)
I am a little pazelled with “size” and “persample” options. Is the command line above correct?

That should work to get you 10000 sequences per group.

Several barcodes are having read counts below 10K. How “sub.sample” treats “<10K” cases?

It will reject the entire group.

Topic		Replies	Views	Activity
sub.sample command Commands in mothur	2	2313	July 13, 2012
sub.sample feature Commands in mothur	2	3908	January 10, 2011
sub.sample Commands in mothur	1	1678	January 21, 2015
Which command can subsample fasta file Commands in mothur	3	2893	February 16, 2014
How to determine size for sub.sample Commands in mothur	1	1581	March 30, 2015