Taxonomy visualization

awil · April 24, 2020, 5:30pm

Hello,
I use OTUs method for 16S analysis. I have some problem about the input file which should be use to calculate taxonomy. Should I use shared file or subsample.shared file to calculate taxonomy in each group?
and then which one is better between mean or median for relative abundance calculation in each group. I found that some genus give different value between using mean and median.
Could you please suggest me?

Thank you very much.

svazquez · April 27, 2020, 2:21pm

I add to this question. I also wander about relative abundances… if we have the subsampled shared file, is there any difference in using the absolute abundances or relative abundances for alfa and betadiversity? In which cases it is advisable to use relative abundances and which is the most recommended way to calculate it? I was told TSS, but I read about several ways of scale and transform data and can´t make my mind about when I should use one or another or both, scale and transform?
Thank you for sharing your thoughts/criteria!

Kendra · April 27, 2020, 3:09pm

Use the subsampled shared file for all analyses. The difference between a sample having 100,000 reads and one that has 5,000 reads (using current PCR based sequencing) is purely technical, it does not reflect anything about the biology of the samples.

@svazquez what types of scaling and transformations are you thinking about doing? it’s easier to answer a specific case than give a general guideline.

svazquez · April 27, 2020, 3:51pm

Hi, thank you!
Firstly, I’d like to know if using the subsampled shared file is the same to work with absolute abundances (reads) or relative abundances (all in all, the samples will have the same amount of sequences, in a way it is scaled, isnt’t it?).
Then, if I am wrong and for some reason we should work for alfa and betadiversity with relative abundances, is the best way to scale all samples to normalize each sample on it’s total sum? or normalize in another way?
Then, either using absolute or relative abundances, to run betadiversity tests or constrained ordinations, should I clr transfrom the abundances? or any other transformation?
This point is one of my weakest, I could never be good at statistics
And if this goes in a way that does not apply anymore to this trhead sorry, I can open a new question
Thank you!!

awil · April 28, 2020, 12:17pm

Thank you very much for your suggestion @Kendra.
I just do the project about gut microbiome in human for the first time. I found that when I used mean for calculation, genus Escherichia for normal group (7.080%) has a little bit more relative abundance than disease group (7.000%). However, if I used median, genus Escherichia for normal group (0.875%) have less relative abundance than disease group (1.8%). This confuses me. However, I did not check them for statistical significance between them. I maybe check them again.

Kendra · April 28, 2020, 10:21pm

I don’t transform further. Use mothur to calculate alpha (summary.single) and beta (dist.shared) diversity measured because you can repeatedly subsample and get an average for each index.

system · May 8, 2020, 10:26pm

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
when subsample? - correspondence of shared and taxonomy file Commands in mothur	5	3453	January 24, 2017
subsample? normalize? relabund? Commands in mothur	1	900	March 13, 2018
sub.sample and normalize.shared Theory behind mothur	1	4892	April 2, 2013
Issues on sub.sampling mothur bugs	5	8130	June 14, 2014
Transforming OTU data into sample counts Commands in mothur	3	908	May 18, 2017

Taxonomy visualization

Related topics