So I have a bit difficulty in understanding “cluster” and “classify.seq”.
“classify.seq”, let the user to assign their sequences to the taxonomy outline of their choices. so I assume basically “muthor” calculate the distance of each sequence to the taxonomy groups in the “.tax” files and decides what sequences should belong to which group.
In the other hand, there is “cluster”, which works on the distance matrix, and based on the distance matrix find out sequences that are close together and form clusters.
So in principle, either we have a set of OTUs and would like to see, what sequences belong to which, or we don’t have OTUs and would like to figure out potential OTUs.
here are my questions,
- Is this my understanding correct ?
- what is the measurement for calculating these distances ?
- is this distance matrix, symmetric ? looks like a correlation matrix ?
Thank you