Unique sequence vs PCR duplicates

quantrix · May 20, 2015, 4:46pm

I apologize in advance if this is an obvious and dumb question.

What is the PRECISE difference between a PCR duplicate
versus
A unique sequence?

I tried looking for it all over, but was unable to obtain a clear explanation. Now, my thought process is something like this.

A PCR duplicate is something which is right off from the NGS instrument. i.e., you look for PCR duplicates from your Raw Fastq file.
Whereas a Unique sequence is …?

If you have a sequence as

Read 1 - ATCGCCCTA
Read 2 - ATCGCCCT

Would these be TWO unique sequences? If so, what is the algorithmic THRESHOLD in Mothur Unique.seqs to call two sequences as Unique versus Non-Unique?
Thanks for the favor of a reply.

pschloss · May 25, 2015, 2:43pm

For two reads to be identical, they must have the same length and sequence. So the two reads you presented would be different by unique.seqs.

Topic		Replies	Views
About identifying unique sequences Commands in mothur	1	1990	April 24, 2014
unique.seqs for identical but varying length reads Commands in mothur	2	2262	July 31, 2013
unique.seqs command Commands in mothur	4	34405	February 11, 2013
help with unique.seqs Commands in mothur	6	5016	February 28, 2014
The number of sequences behind a unique sequence Feature requests granted	0	5838	June 15, 2011

Unique sequence vs PCR duplicates

Related topics