How it is possible to perform multivariate analysis when the number of variables (=taxa) are much higher than samples?

_renh · August 18, 2017, 11:24am

Hello mothur’s users!

I followed the MiSeq SOP pipeline of mothur in order to preprocess and assign taxonomy of my environmental 16S rRNA amplicon datasets (V4-5 hyper variable regions).

Now, I’m doing downstream analysis. Particularly, I’m interested to know how similar are my samples from each other. To doing so, I built a distance matrix with the unweighted UniFrac metrics in order to plot the results using the principal coordinates analysis (PCoA) method. However, some doubts arose related with the few number of samples (n=9) that comprise my own dataset.

The specific question is: ‘How it is possible to perform multivariate analysis (through PCoA) when the number of variables (unweighted UniFrac metrics, based on presence/absence of OTUs between samples) are much higher than the number of observations/samples (n=9)?’

I'm a newbie in bioinformatics and, unfortunately, I'm not good at statistics. Can anyone shed some light here! Thanks in advance. Kind regards, @renh@

Kendra · August 22, 2017, 2:02am

first don’t do pcoa unless you know your underlying gradient is linear (I’ve yet to see a natural community that is). Use NMS instead, it only assumes monotonicity rather than linearity

second, you’re variables aren’t taxa for ordinations, they are samples. You’ve transformed your data into a dissimilarity matrix, so you have the same number of variables as you have samples.

_renh · August 22, 2017, 6:09pm

Thank you kmitchell.
Cheers,
@renh@

Topic		Replies	Views
PCoA analysis Commands in mothur	5	2532	August 18, 2016
Analysis of paired samples? Theory behind mothur	3	1831	October 20, 2016
Could I create PCoA Plot in Mothur program? Commands in mothur	4	1085	March 21, 2019
PCoA R-squared error? mothur bugs	1	3445	December 16, 2013
pca commands Commands in mothur	15	11044	May 3, 2013

How it is possible to perform multivariate analysis when the number of variables (=taxa) are much higher than samples?

Related topics