cluster vs cluster.split with many unclassifieds

FM_Kerckhof · February 19, 2015, 9:07pm

Hi,

I am currently processing amplicon sequencing data (Illumina MiSeq) of piezophillic organisms cultivated with long chain alkanes under very high pressure (up to 200 bar). The original inoculum came from a deep sea core sample and hence it would seem likely to me that not all 16S sequences can be classified, even using the RDP trainset 10 (pds version).
I am inclined to not kick out these “unknowns”: although I also don’t want to actually include them in elaborate analysis, I would like to see if there is a relation with my design and more “unknown” 16S occurs at higher pressures (incubations were done at different, increasing, pressures).
So my question is how cluster.split handles “unknown” sequences and if it is better in this case to use the regular cluster command.
If I am making a major mistake here by not kicking out the unknowns when I had to please let me know. I am aware that this is not necessarily a proxy for unknown diversity and could also be sequencing erors, but if there is a systematic correlation with the experimental design this might be an indication that we are looking at some unknown stuff as we increase the pressure, no :?:

Thanks in advance for your input.

pschloss · February 20, 2015, 1:45pm

An unknown is a read that doesn’t classify at the level of kingdom. I wouldn’t trust them, but you can always kick them out later. These would form their own group in the splitting algorithm and be clustered separately from everything else. We generally see these when we get non-specific amplification products that somehow make it through the pipeline.

Topic		Replies	Views
Cluster.split â€“ limit of sequences it can handle? Commands in mothur	3	2871	August 22, 2014
cluster.split Commands in mothur	13	8636	July 15, 2013
Stuck at cluster.split Commands in mothur	2	77	February 19, 2024
Cluster.split issue "Num_Dists_Below_Cutoff" Commands in mothur	4	1093	March 14, 2019
Error message when doing cluster.split Commands in mothur	6	4956	October 20, 2014

cluster vs cluster.split with many unclassifieds

Related Topics