Optimal point for merging several 454 runs after sff?

cbharder · March 21, 2014, 4:40pm

I have 454 data from 120+ patients divided on 6 separate 454 runs, using the same 32 barcodes, i.e. merging sff files is not an option. They represent 6 nominal groups. I wish to combine them in two ways: 1) just combining the runs and making a shared file for all of the individual patients, and 2) merging the patients into each nominal group and making a shared file for each nominal group. I have a standard workflow with trim.seqs for quality, remove singletons, aligning against a subsample (checked with BLAST) and removing the unalignable. I will then proceed into clustering, making shared, rarefaction, classification etc.

My question is: At what point is is most appropriate to do the merging? and if one merges the group, name and fasta files, using the merge.files command, do you then automatically retain the information in a way that makes it possible to run a dist.seqs and make.shared for the complete 6 454 runs subsequently?

pschloss · March 28, 2014, 4:30pm

You should merge after trim.seqs. Also, you can do this with sff.multiple.

Pat

Topic		Replies	Views
Merging Files Theory behind mothur	9	8160	January 20, 2014
Problems combining data from different runs Commands in mothur	6	4482	May 16, 2014
sff.multiple crash Theory behind mothur	1	3173	January 15, 2014
Merging data sets Commands in mothur	4	4173	February 2, 2012
merge sff from different runs Commands in mothur	5	1662	January 26, 2016

Optimal point for merging several 454 runs after sff?

Related topics