Does anyone know of a good estimate for analysis times for functions based on number of sequences and sequence length? I’m paying for server time to do some analyses and I was wondering if there are any calculators for the length of time an analysis may take? I know the variables involved on the server side would be CPU speed , memory, and OS but there may be others.
So for example, if I was going to create a distance matrix of 500,000 sequences I would assume that with a given processor on a given OS and with a given amount of memory, it may take say 10 seconds per 100 bp per sequence to calculate each pairwise distance. You could then plug in your average sequence length and the total expected analysis time would be a permutation of this?
I’m particularly interested in being able to calculate analysis times for align.seqs, dist.seqs, and cluster.