RAM needed for cluster.split

Hi mothur forum,
mothur is really helpful but how to know when the pc can’t handle cluster.split?
I’m running it with column, count, taxonomy and splitmethod classify. The distance matrix is ca 70 GB. The largest temp file is ca half that size now. I wonder if my pc (standard 8 GB RAM) can handle this or I have to use a fasta file with cluster.split instead, or perhaps cluster using phylotypes.

It would likely run much faster if you gave it a faster file rather than a distance matrix. That being said, 8 GB isn’t much and windows always seems to run slower than running it on a Mac or Linux computer. Can you run it on a high performance computer cluster at your institution or on Amazon?


Thanks, yes I could try a high performance computer cluster. But before doing that, is there a rule of thumb to indicate RAM needed? For example, how much RAM does the PC have to be equipped with to process a 10, 50 or 100 GB distance matrix?

Sorry, but it’s hard to say. I first would try running it with a fasta file and see what happens. I just noticed you said you had a 70GB distance matrix, which is quite large and I worry that it might not be possible to run through cluster. See this blog post for a partial explanation and some thoughts