Remove singletons

flavobacteria · May 24, 2016, 1:04pm

I am trying to remove singletons and go from there for alpha and beta diversity analysis. Can anyone advise which step this command should be inserted?
I did these:
Following through Mothur SOP,
mothur > sub.sample(shared=stability.an.shared, size=6105)
Sampling 6105 from each group.
0.03

Output File Names:
stability.an.0.03.subsample.shared

mothur > remove.rare(shared=stability.an.0.03.subsample.shared, nseqs=1) 0.03

Output File Names:
stability.an.0.03.subsample.0.03.pick.shared

mothur > count.groups(shared=stability.an.0.03.subsample.0.03.pick.shared) Total seqs: 207161.

Output File Names:
stability1.an.0.03.subsample.0.03.pick.count.summary

Get the minimal number of OTUs that were present in all of these samples, then go from there for :
collect.single(shared=stability1.an.0.03.subsample.0.03.pick.shared, calc=chao-invsimpson, freq=100)

Does this insertion make senses? I mean, if I want to compare the results with and without the removing the singletons.

Soon · May 24, 2016, 2:04pm

For me, I remove them after clustering and before making shared file:

cluster(column=current, count=current)
remove.rare(list=current, count=current, nseqs=1, label=0.03)
make.shared(list=current, count=current, label=0.03)
classify.otu(list=current, count=current, taxonomy=current, label=0.03)

It works so far.

Soon

westcott · May 24, 2016, 2:21pm

You might also like the filter.shared command, http://www.mothur.org/wiki/Filter.shared.

flavobacteria · May 25, 2016, 12:31am

Thank you!
The taxonomy=current does not work (see below): if I use the one produced before removing singletons, will it messes up the classification?

mothur > classify.otu(list=current, count=current, taxonomy=current, label=0.03)
Using stability1.trim.contigs.good.unique.good.filter.unique.precluster.uchime.pick.pick.pick.count_table as input file for the count parameter.
Using stability1.trim.contigs.good.unique.good.filter.unique.precluster.pick.pick.an.unique_list.0.03.pick.list as input file for the list parameter.
[WARNING]: no file was saved for taxonomy parameter.
You have no current taxonomy file and the taxonomy parameter is required.
reftaxonomy is not required, but if given will keep the rankIDs in the summary file static.
[ERROR]: did not complete classify.otu.

Thank you again!

Soon · May 26, 2016, 12:38am

I used the most current taxonomy file in the workflow, which is after the remove.lineage() for me. That was before removing the singletons, so it should be fine.

pschloss · May 28, 2016, 10:41am

FWIW, I strongly discourage the removal of singletons for alpha and beta-diversity analysis. As in, if I got your manuscript, I would raise a red flag. All of the metrics are dependent on the distribution of sequences. Removing singletons will disproportionately affect samples with higher sequencing depths. Instead, the better choice is to rarefy your data to a common number of sequences.

Pat

Topic		Replies	Views
Removing singletons (yeah, I know) Commands in mothur	4	4928	October 5, 2015
split.abund and several samples Commands in mothur	1	1146	August 29, 2016
Rare and abundant OTU Commands in mothur	3	4766	November 11, 2014
Remove singleton OTU-s Commands in mothur	1	6663	February 2, 2012
singletons Commands in mothur	1	745	November 10, 2017

Remove singletons

Related topics