phylogenetic tree with data from several regions of the 16S

mbakker · January 10, 2013, 10:17pm

Is it possible to use phylogenetic diversity measures on data that includes sequence from several regions of the 16S gene (eg. if they’ve been summarized/combined as a single taxonomy file)? The programs for building massive trees seem to require aligned sequences - but it seems like you ought to be able to map directly from a taxonomy file to branches on an already-built tree (eg. derived from an external database). Has anyone done this?
Thanks!

pschloss · January 11, 2013, 12:49pm

Yeah Rob Knight’s group has done this. I personally think this is fraught with all sorts of problems. Problem #1 is that all primer sets have different biases.

flavio · September 22, 2014, 11:31pm

What if wanted to do anyway, how would I do that? Rob Knight has published that?
Thanks.

dwaite · September 23, 2014, 12:22am

It sounds exactly like closed-reference OTU picking in QIIME. It’s here, just use one of the ‘_ref’ methods.

As Pat says though, this isn’t a great method to use without serious justification.

pschloss · September 23, 2014, 1:43pm

Their own paper seems to indicate that it’s pretty worthless:

ncbi.nlm.nih.gov

Meta-analyses of studies of the human microbiota.

CA Lozupone, J Stombaugh, A Gonzalez, G Ackermann, D Wendel, Y Vázquez-Baeza, JK Jansson, JI Gordon and R Knight, Genome research, Oct 2013

Our body habitat-associated microbial communities are of intense research interest because of their influence on human health. Because many studies of the microbiota are based on the same bacterial 16S ribosomal RNA (rRNA) gene target, they can, in principle, be compared to determine the relative importance of different disease/physiologic/developmental states. However, differences in experimental protocols used may produce variation that outweighs biological differences. By comparing 16S rRNA gene sequences generated from diverse studies of the human microbiota using the QIIME database, we found that variation in composition of the microbiota across different body sites was consistently larger than technical variability across studies. However, samples from different studies of the Western adult fecal microbiota generally clustered by study, and the 16S rRNA target region, DNA extraction technique, and sequencing platform produced systematic biases in observed diversity that could obscure biologically meaningful compositional differences. In contrast, systematic compositional differences in the fecal microbiota that occurred with age and between Western and more agrarian cultures were great enough to outweigh technical variation. Furthermore, individuals with ileal Crohn's disease and in their third trimester of pregnancy often resembled infants from different studies more than controls from the same study, indicating parallel compositional attributes of these distinct developmental/physiological/disease states. Together, these results show that cross-study comparisons of human microbiota are valuable when the studied parameter has a large effect size, but studies of more subtle effects on the human microbiota require carefully selected control populations and standardized protocols.

“Together, these results show that cross-study comparisons of human microbiota are valuable when the studied parameter has a large effect size, but studies of more subtle effects on the human microbiota require carefully selected control populations and standardized protocols.”

In other words, why bother?

Topic		Replies	Views
Extracting 16S data from metagenomic samples. Theory behind mothur	1	4089	March 21, 2013
MOTHUR for microbiome species composition profiling	4	357	March 31, 2020
Considerations For Optimizing Microbiome Analysis Using a Marker Gene Journal club	0	3014	July 26, 2016
Inquiry on 16S rRNA Analysis Using Mothur and Data Trimming Techniques	1	8	June 4, 2025
My Own Journal club	1	4402	March 30, 2015

phylogenetic tree with data from several regions of the 16S

Related topics