chimera.uchime

bhassett · February 12, 2015, 2:20am

Hi Friends,

Running the chimera.uchime command on a larger file (~1.2million seqs), using my countfile as a reference. To date, the command has been running >1 month on an 18S rRNA data set. Wondering if anyone has experienced any comparable runtimes or if anyone’s processing rate continuously decreases as the file nears completion (30seqs/sec at start versus 1seqs/7secs at 97% completion).

Many thanks!

pschloss · February 16, 2015, 2:27pm

Whoa, that’s a long time. How many processors did you set? How long is the region in the 18S gene that you’re sequencing? What sequencing platform are you using?

bhassett · February 17, 2015, 1:45am

Hey Pat,

That particular dataset was running on 8 processors, but I’ve experienced similar runtimes using 32.

~450 bps from a 2x250 MiSeq run.

Sure takes a long time, but the data look great.

pschloss · February 20, 2015, 1:35pm

The problem is likely because your reads do not fully overlap (I know this is hard to design for 18S) and so you have a high error rate, which effectively inflates the number of unique sequences and makes everything take longer and more RAM. See:

http://blog.mothur.org/2014/09/11/Why-such-a-large-distance-matrix%3F/

Also, just to be clear, you’re giving it a count or group file, right? Any sense how many groups it has processed?

Related to the blog post above, I suspect that even if it will go through chimera.uchime you won’t be able to form OTUs. It will likely be necessary to do a phylotype-based approach.

Pat

Topic		Replies	Views
long processing time chimera.uchime? mothur bugs	4	1275	March 14, 2017
chimera.uchime running for ever Integrating mothur with other programs	1	1594	May 11, 2017
suggestions for large files for uchime de novo Commands in mothur	2	2079	March 13, 2015
CHimera uchime Commands in mothur	1	1812	July 19, 2013
Uchime Commands in mothur	4	2153	October 19, 2015

chimera.uchime

Related topics