Rep sequences are too similar

wuyun · August 3, 2011, 7:20pm

Hi, after I generate the representative sequence for each OTU, and use all these rep sequences to construct a database for blast.
And I randomly chose several rep sequences out to do the blast, the other rep sequences picked by blast is extremely high identical to the template sequence.
For example, my rep sequences are from the cutoff at 0.03, but after blast using one certain rep sequence, it finds several other rep sequences showing 99% identity to the original one.
I know this problem is generated because of the clustering, but average neighboring method is already the best available method to use.

I learned from the manual that you can classify them, and can combine the OTU classified as same genus or so. But the classified database is also limited, and I can’t get all my sequences classified at genus level and do the comparison. Actually that’s why we choose the OTU based analysis.

Is there anyway to bypass this issue ? Or any strategy to deal with this ?
My ultimate goal is to design the OTU specific primers, but with those extremely high similarity between rep sequences, I can not generate any specific primers.

pschloss · August 8, 2011, 11:41am

is that 99% similarity over the full length of the sequence or only part of it?

wuyun · August 8, 2011, 6:00pm

It is over the full length of the sequence.
If only part of it, it will not be a problem.

pschloss · August 10, 2011, 4:25pm

Sorry, but I am having a really hard time trying to understand what you’re trying to ask/say…

Topic		Replies	Views
Different OTUs but 100% seq identity Theory behind mothur	1	1249	February 13, 2017
Get.oturep with user-defined clusters Commands in mothur	3	3456	May 11, 2011
cluster.seqs and OTU assignment Commands in mothur	2	3365	January 11, 2011
Merging OTU's Commands in mothur	1	2504	October 4, 2011
get.oturep Theory behind mothur	3	8002	February 14, 2011

Rep sequences are too similar

Related topics