clustering time grows linearly with .dist file size, right?

Fabian · September 26, 2014, 8:57am

Hi,

Please remind me whether this is correct: Clustering time grows quadratically with the number of sequences, but linearly with the size of the distance file. This is because the size of the distance matrix already grows quadratically with sequence number, right?

I am talking “cluster” command here.

Thanks,
F

pschloss · October 2, 2014, 11:53am

I think that should be correct for cluster.classic. Although, we do cluster differently since we turn everything into a sparse matrix. I think it might actually be a bit faster and definitely uses less memory.

Fabian · October 7, 2014, 4:28pm

Thank you!

Topic		Replies	Views
Cluster() Theory behind mothur	3	5216	August 24, 2010
Segmentation fault when clustering a 1.44 GB dist file mothur bugs	5	135486	November 14, 2009
Command cluster_issue	17	890	November 28, 2021
Clustering a 10GB distance matrix mothur bugs	2	4197	March 16, 2011
Issues with cluster command Commands in mothur	5	4453	December 19, 2012

clustering time grows linearly with .dist file size, right?

Related topics