MacPro and mothur

Jatinder · August 12, 2014, 6:44pm

I am using a MacPro (3.5 GHz 6 core, 64 GB RAM) to analyze MiSeq data (17GB). It took more than 52 hours to get through the chimera.uchime command and has taken 108 hours to run through the cluster.split command and it has still not completed the command. Does this seem to be normal amount of time used, even after using this much of computing power? Is there a way to speed it up as I have four times this data available, to be analyzed?

Thanks

pschloss · August 13, 2014, 12:11pm

What type of MiSeq data? V4?

Jatinder · August 13, 2014, 12:49pm

Thank you for the reply. The data is for the V34 region. Also, the command is still running, not completed yet.

pschloss · August 13, 2014, 2:36pm

You cannot get a decent error rate sequencing the v34 region since the reads will not fully overlap. The upshot is that because your reads don’t fully overlap, you’ll get a high error rate, resulting in a large number of unique reads. You’re likely to be stuck doing the phylotype-based approach or going back and sequencing the V4 region.

Pat

Jatinder · August 13, 2014, 5:05pm

Thanks, Pat. If I understand it right you are suggesting that instead of doing the cluster.split command, I do the phylotype command instead and use the shared file generated from here for further analyses? Also, is there a way to speed-up the processing and use all the computing power I have available?

Jatinder

pschloss · August 15, 2014, 12:44am

The best way to speed it up would be to sequence the V4 region

Topic		Replies	Views
duration of analyses Commands in mothur	7	5302	May 30, 2014
cluster.split V4 MiSeq runtime problem Commands in mothur	7	3538	February 25, 2016
Stuck at clustering, its running for more than a week Commands in mothur	7	417	January 20, 2024
Cluster split command running for days Commands in mothur	3	418	July 1, 2022
cluster.split Commands in mothur	10	10448	March 12, 2015

MacPro and mothur

Related topics