cluster.split

jdf · January 13, 2015, 1:42pm

Hi there,
I tried to run cluster.split on my MiSeq data, but was unable to complete it as the temporary files produced took up approximately 21TB. I just had a look at how many sequences I was trying to cluster (929,618 unique/6,416,820 total – which is down from approx 26 mil raw reads at the beginning of my analysis). Someone must have encountered this problem before, any insight as to what I can do?

Thanks in advance,
Jessica

pschloss · January 16, 2015, 4:05pm

I think this will explain what’s going wrong and perhaps some paths forward…

http://blog.mothur.org/2014/09/11/Why-such-a-large-distance-matrix%3F/

Pat

Topic		Replies	Views
Use cluster.split on MiSeq data Commands in mothur	15	13895	May 9, 2013
cluster.split making TB of temp files Commands in mothur	3	1419	September 30, 2016
Cluster.split â€“ limit of sequences it can handle? Commands in mothur	3	2896	August 22, 2014
cluster.split error mothur bugs	2	2215	March 14, 2015
Cluster.split-trouble Commands in mothur	2	2033	March 13, 2015

cluster.split

Related topics