sequence duplication during 454 SOP v.1.29

jrapp · February 5, 2014, 1:09pm

Hi,

I used the 454 SOP up to the removal of chimeras, but interrupted it afterwards. I then deunique.seqs, split.groups and degap.seqs, since I wanted to compare taxonomic classification output of mothur with a different pipeline. When checking the outputted fasta files, I realized that some of the headers & sequences were duplicated. That was only the case for some sequences, e.g. 70 sequences in a 3000 sequences sample, but happened for all of my 12 samples. I have no idea at which step that happened. Do you have any ideas? Was that reported before?

Thanks for any help and comments!

westcott · February 5, 2014, 2:35pm

Have you tried it with our current version?

jrapp · February 5, 2014, 2:50pm

No, I haven’t done it yet. I just wanted to know whether anybody else ever encountered something similar. I guess, I wouldn’t have noticed, if I would have followed the complete SOP. In the original fasta file from the sequencing company are no duplicate headers &sequences, of course.
Since its just a duplication, the overall picture of relative abundances etc doesn’t change, but in total there are roughly 1300 sequences that were not present in the beginning of processing the data.

westcott · February 7, 2014, 3:05pm

If you post the exact command you ran I may be able to spot the issue?

Topic		Replies	Views
unique.seqs error mothur bugs	11	4154	July 8, 2021
Recreating 454 SOP [SOLVED] Commands in mothur	5	3694	December 18, 2014
Unique.seqs error duplicate sequences	4	677	June 5, 2022
Sequence duplication in screen.seqs, unique.seqs	1	654	July 16, 2021
miseq SOP unique.seqs in your fasta file, sequence names must be unique, please correct. Commands in mothur	1	784	January 5, 2017

sequence duplication during 454 SOP v.1.29

Related topics