Processing 454, Illumina, and Sanger Data

group2 · April 24, 2012, 7:00pm

Hello,

I’m currently part of a computer science bioinformatics class at a state university. In this class we’ve been split into different groups, with different goals. Our group was tasked with comparing two different microbial community software packages. One of these software packages is mothur. Furthermore what we’re to do with mothur is that we’re to take sequence read data that was produced by a 454, Sanger, and Illumina machine, and process it through mothur via the steps that will allow for OTU picking, alpha diversity, and finally beta diversity.

This data has already been demultiplexed.

Our group has tried several times to follow the workflows provided by the analysis examples, but have had no luck for any of these data sets.

From our understanding we believe that in order to perform OTU picking our workflow is:

align.seqs
filter.seqs
dist.seqs
cluster

Then to perform the alpha diversity analysis we would just use:

rarefaction.single

And then finally we would perform our beta diversity analysis using:

rarefaction.shared

Is this the correct workflow that we should be doing? If not could someone elaborate on what the correct workflow should be as well as provide some brief reasoning.

We believe one of the reasons we’ve been having issues with our data is that the sequences haven’t been cleaned up to account for different read lengths. When investigated we found that there were some processes like screen.seqs, and chop.seqs. However, with our preliminary testing we’ve had no luck in getting different results after applying these.

Is there a specific nuance to these steps that we may be forgetting to account for?

Lastly we’ve seen in a few of the steps that there can be a name file provided as well. Is there a process to extract a name file from some step of the output, because as it stands we haven’t created an artifact that represents the name file.

We appreciate any input that anyone has for our questions.

westcott · April 24, 2012, 7:09pm

You may want to take a look at http://www.mothur.org/wiki/Schloss_SOP.

Topic		Replies	Views
Cluster sequence into OTUs Theory behind mothur	23	15372	January 9, 2015
Problem Qiime/mothur different results Integrating mothur with other programs	5	5939	August 4, 2015
Normalizing sequences in each sample Commands in mothur	8	7744	January 9, 2015
Unique nseq & a lot of "Bacteria; unlcassified" Commands in mothur	1	2418	March 30, 2015
Clustering at 98% identity threshold level Commands in mothur	2	830	August 9, 2017

Processing 454, Illumina, and Sanger Data

Related topics