Troubles with clearcut in MiSeq SOP

JIngermann · November 16, 2017, 9:19am

Dear mothur community,

I am a PhD student in a lab with no bioinformatics experience at all but ended up with a 16s dataset I’m supposed to analyse. I am working my way through the MiSeq SOP using Amazon Web Services and the 1.3.95 AMI. I did all steps using the r4.xlarge with 30.5 gb of RAM using 4 processors until I ran dist.seqs as described under the phylogenetics header. The resulting .dist file was ~40 gb in size (guess who ended up with the 2014 blog entry as well). So I changed the system to r4.2xlarge with 61 gb RAM, since I thought that should be able to read in the matrix and proceed with creating the .tre.
The issue now is: clearcut seems to stop working. I’m running mothur in screen and normally go back to the terminal to monitor RAM usage by “top”. For a while I can see RAM usage increasing, until it stops at the same point in all runs and then nothing happens anymore (I checked this for several hours).

Does anyone know why or how this happens? Could there be something corrupted in my .dist file that stops the reading process? Or is the .dist file simply too large for the clearcut program? Any ideas would be highly appreciated, since right now I’m stuck and have no idea how to proceed.

Best

Jonas

pschloss · November 17, 2017, 1:43pm

Hi Jonas,

I’d skip clearcut. We have yet to see results from OTU-based analyses conflict with those of a phylogenetic analysis. As you’re seeing the distance matrix that is produced is so large that it’s just not possible.

Pat

JIngermann · November 20, 2017, 12:35pm

Hi Pat,

thanks for the info. I’ll try to move on with the OTU based approach.

Best

Jonas

Topic		Replies	Views
problem with clearcut mothur bugs	23	19949	December 21, 2015
Problem of Clearcut Commands in mothur	3	3876	March 24, 2016
Issues with clearcut Commands in mothur	2	457	July 22, 2021
clearcut Commands in mothur	1	1607	June 23, 2016
clearcut terminates abruptly mothur bugs	1	1225	January 25, 2017

Troubles with clearcut in MiSeq SOP

Related topics