pre.cluster only uses 1 processor

ch3coch3 · September 19, 2017, 6:17pm

Hi,

I’ve been running into this situation with one particular data set for a while: it takes more than 10 days and still counting to run pre.cluster command in a data set with 12 samples. I tried the local workstation and HPC in our university, same slow process. I used mothur 1.39.0

The data set targets ITS1 for fungal community survey. All the parameters and commands I used for this data were the same I used before (also ITS1 for fungal community). Previously all the commands ran through without a problem (used mothur 1.38.1)

One of the potential explanations might be the fact that pre.cluster command only used 1 core in HPC node according to HPC log file, even though it says “using 12 processors” when pre.cluster was running.

What would be the solution for the slow process, please?

Thank you so much!

westcott · September 25, 2017, 1:43pm

The number of processors can vary as the command processes. This is because for some parts of the command only one processor is running, while in other areas all 12 will be used. How many sequences are in your dataset?

ch3coch3 · September 25, 2017, 3:58pm

It has 1,357,056 raw seqs, 930,737 in xx.trim.contigs.good.fasta, and 211, 068seqs in xx.trim.contigs.good.unique.fasta

There are 12 samples in the dataset

Topic		Replies	Views
pre.cluster use of processors parameter Commands in mothur	3	2675	November 18, 2013
pre.cluster taking a long time mothur bugs	8	2678	May 10, 2017
Pre.cluster is taking forever mothur bugs	1	398	March 21, 2022
Pre.cluster not working and quit mothur mothur bugs	5	1158	July 30, 2019
Pre.cluster hanging on sample processing Commands in mothur	2	598	June 17, 2019

pre.cluster only uses 1 processor

Related topics