I am getting this result after doing mothur > summary.single(shared=stability.an.shared, calc=nseqs-coverage-sobs-invsimpson, subsample=14272) [14271 is the no. of sequence of the smallest samlple]
You don’t want to run summary.single on subsampled data. The reason the std (standard deviation) is zero is because there’s no variation since all of the samples have the same number of reads.
Do you mean we should remove the subsample=# option from summary.single command? Or using the original (before subsampled) .shared file not the .subsample.shared file?
The sub.sample command will generate a separate .subsample.shared file and the original .shared file is not modified, am I right?
Also, how informative those “std” values are? Can I just use the “ave” values to make a table and put it in my dissertation? Thanks!
If I understood correctly, in mothur > summary.single(shared=stability.an.shared, calc=nseqs-coverage-sobs-invsimpson, subsample=2441), 2441 is the total no. of sequences of the smallest sample in SOP.
But you wrote: " You don’t want to run summary.single on subsampled data". Why so ?
Are you suggesting that I should not run summary.single on subsmapled data, if I have to work with Standard deviation values ??
But you wrote: " You don’t want to run summary.single on subsampled data". Why so ?
Are you suggesting that I should not run summary.single on subsmapled data, if I have to work with Standard deviation values ??
Sorry, I’m lost. you just quoted what I said and asked if that was what I said. The input to summary.single should be the file that is generated by make.shared and should not be subsampled prior to running summary.single.