Merge .fasta files for OTU counting

adrian.lopez · January 25, 2019, 1:10pm

Hello!
I’m new to the forum, so I apologise if my question is duplicated.
I performed a 16S analysis of 192 samples following the available MiSeq SOP, but I made it separately for each sample instead of using the make.file command.

For this reason when I try to use the dist.seqs + cluster commands for OTU clustering the OTU numbers for each .fasta file are not correlative among samples.

Is there a way to merge all my filtered and aligned .fasta files keeping a sample codification in order to obtain an OTU table with the number of reads for each OTU for all samples?

I read about merge.files but I’m not sure if this command will create a groups file keeping codes for each sample…

Any help would be greatly appreciated.

Adrian

pschloss · January 28, 2019, 9:59pm

You should merge the files after running make.contigs and then take them through the rest of the pipeline. Otherwise, the alignments will be out of whack after running filter.seqs on the files separately.

adrian.lopez · January 29, 2019, 7:26am

But processing sequences from 192 samples within the same file (including heavy steps as pre.cluster or chimera.uchime) seems computationally expensive to me…
Is there any way to split the merged file and parallelize this pipeline? (At least for some steps)

pschloss · February 5, 2019, 12:28am

The individual step are parallelized. Also, by processing them together, we take advantage of the redundancy across samples to get further speed up.

adrian.lopez · February 13, 2019, 9:37am

Great. And should I use merge.files for that? Will this keep a sample identification on merged file? Or is it better to use another command?
Thank you very much for your time!

pschloss · February 19, 2019, 4:16pm

Sorry - why aren’t you using the files option in make.contigs? I think that would make your life so much easier. Alternatively, you can concatenate the files as you please

system · March 1, 2019, 4:16pm

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Multiple fasta files Commands in mothur	5	4159	May 9, 2014
Using preprocessed merged reads Commands in mothur	4	1428	March 27, 2017
Merging data sets Commands in mothur	4	4175	February 2, 2012
Combining multiple fasta files for subsequent analysis Commands in mothur	1	1362	March 20, 2016
Merge duplicate MiSeq runs Commands in mothur	4	5271	August 16, 2013

Merge .fasta files for OTU counting

Related topics