Analyzing hundreds of datasets simultaneously

peleliu · July 24, 2012, 9:17am

Hi,
After working with MOTHUR using my own samples, I would like to try and work with 454 datasets available on the web (NCBI SRA, MG-RAST etc.). I have downloaded ~200 454 datasets; I could go through the Schloss SOP 200 times and then compare the results, but that would take a long time. Is there some way that can allow to do them all simultaneously? Would merging all of them into a single dataset cause memory problems? If not merging, perhaps there is a script that allows to run the 200 datasets one after the other automatically…?
Thanks!

Kendra · July 24, 2012, 5:18pm

I’m not a programer but have managed to run >500 separate datasets through the first few steps of the SOP using “for” and command line mothur

http://www.mothur.org/wiki/Command_line_mode

peleliu · July 24, 2012, 5:46pm

Interesting! This is immensely helpful.
Can you please post your entire script?
How are your datasets organized?
Thanks again…

Kendra · July 24, 2012, 6:43pm

this is it so far, my files are all in the folder that I’m in before running this. Also my bacterial sff start with B and euk E, hence the separate oligos files and commands. I’m kind of making this up as I go along, so no flames if it doesn’t work for you

for n in .sff; do mothur “#sff.info(file=$n, flow=T)”; done
for n in B.flow; do mothur “#trim.flows(flow=$n, oligos=B.oligos, pdiffs=2)”; done
for n in E*.flow; do mothur “#trim.flows(flow=$n, oligos=E.oligos, pdiffs=2)”; done
for n in *.flow; do mothur “#shhh.flows(flow=$n, processors=2)”; done

peleliu · August 2, 2012, 7:29am

Thanks a lot.
Pat, any other thoughts will be much appreciated…

pschloss · August 2, 2012, 12:05pm

About all I can recommend is to ask your sequence provider to not split everything up like this

Topic		Replies	Views
Too much data, a way to combine outputs? Commands in mothur	1	1821	December 11, 2013
Data import Commands in mothur	4	5426	May 3, 2013
454SOP Commands in mothur	3	1222	March 7, 2016
how to handle several ssf files Commands in mothur	2	2119	January 23, 2014
Mothur for large amount of data Feature requests	7	6787	September 26, 2013

Analyzing hundreds of datasets simultaneously

Related topics