similarity between sequences

elisaccp · August 2, 2012, 11:49am

Hi!
I am cultivating new organisms of Archaea and some of them seem to be the same. I would like to calculate if they’re 16S rRNA gene sequences have more than 97% of identity in order to make a first conclusion of these microorganisms been the same or not.
I know how to cluster sequences by their similarities according to cutoffs pre established. But I was wondering if there is one way to estimate the percentage of similarity between two sequences of 16S rRNA gene with mothur.
Thank you.

pschloss · August 2, 2012, 12:08pm

dist.seqs will give you a distance matrix (1-similarity) alternatively, running classify.seqs with an alignment and the distance method should give you distances to the closest match in the database.

elisaccp · August 2, 2012, 12:49pm

Thank you.
I will try that also. But I was wondering a way to calculate if two sequences of mine have more than 97% of identity. I need the number of similar nucleotides, not just the name of the closest match in a database.

pschloss · August 2, 2012, 1:58pm

If you use the distance based approach with knn=1 you’ll get the percent id.

elisaccp · August 2, 2012, 2:21pm

thank you

Topic		Replies	Views
Distance to nearest neighbor (classify.seqs) Feature requests	3	6511	July 23, 2010
OTU Percentage Similarity Related To Taxon Level Theory behind mothur	5	2839	June 2, 2017
Clustering sequences Theory behind mothur	2	2065	December 21, 2015
Does mothur get confused by different copies of 16S in the single species? Theory behind mothur	4	725	June 22, 2019
Distance matrix size advice Commands in mothur	7	14036	February 12, 2010

similarity between sequences

Related topics