Seq.error output strange

qinglong · November 12, 2020, 7:52pm

Hi Pat

mothur_v1.44.3
seq.error(fasta=final_otu_seqs.fasta, reference=HMP_MOCK.v35.fasta, aligned=F)

HMP-Mock-Even is sequenced with 16S V6V9 primers; just wanna check error rate in OTUs compared to the reference.

Observed some OTUs have weird output:
|query|reference|weight|insertions|deletions|substitutions|ambig|matches|mismatches|total|error|numparents|
|OTU_1008|M.smithii1|1|0|0|0|0|0|0|0|0|1|
|OTU_2330|S.pneumoniae1|1|0|2|15|0|281|17|298|0.057047|1|

OTU_1008
TAGTCGTCGGTTAAGTCCGGCAACGAGCGCAACCCACGTCCTTAGTTGCCAGCATTCAGT
TGGGCACTCTAGGGAAACTGCCGGTGATAAGCCGGAGGAAGGTGTGGATGACGTCAAGTC
CTCATGGCCCTTACGGGTTGGGCTACACACGTGCTACAATGGCAGTGACAATGGGTTAAT
CCCAAAAAGCTGTCTCAGTTCGGATTGGGGTCTGCAACTCGACCCCATGAAGTCGGAATC
GCTAGTAATCGCGTAACAGCATGACGCGGTGAATACGTTCCCGGGCCTTGTACACACCGC
CCGTCACACCATGGGAATTGGTTCTACCCGAAGGCGGTGCGCCAACCTCGCAAGAGGAGG
CAGCCGACCACGGTAGGATCAGTGACTGGGGTGAAGT

OTU_2330
CGTTACCCTTAGGTACCTACGGCGGTACTACACACGTGCTACAATGGCTGGTACAACGAG
TCGCAAGCCGGTGACGGCAAGCTAATCTCTTAAAGCCAGTCTCAGTTCGGATTGTAGGCT
GCAACTCGCCTACATGAAGTCGGAATCGCTAGTAATCGCGGATCAGCACGCCGCGGTGAA
TACGTTCCCGGGCCTTGTACACACCGCCCGTCACACCACGAGAGTTTGTAACACCCGAAG
TCGGTGAGGTAACCGTAAGGAGCCAGCCGCCTAAGGTGGGATAGATGATTGGGGTGAAGT

Actually, OTU_1008 could be aligned to R. sphaeroides. So not quite sure what is the reason for this performance. most of OTUs have fine output like OTU_2330.

Thanks.

pschloss · November 16, 2020, 6:13pm

Without looking at the fast file and your HMP_MOCK file it would be hard to say. FWIW, I’d discourage running seq.error on OTU data. It is generally only run on the fasta data prior to clustering as is shown in the MiSeq SOP. If you could forward final_otu_seqs.fasta and HMP_MOCK.v35.fasta to mothur.bugs@gmail.com with a link to this post we can take a look.

Pat

qinglong · November 16, 2020, 9:07pm

Thanks Pat!

Just sent the two files for you to have a look.

westcott · November 18, 2020, 4:27pm

Thanks for sending your files. The seq.error code had a small bug. When OTU_1008 was aligned to the references, some references resulted in an alignment with no overlap. Mothur mistakenly reported 0 diffs between the reference and the query read in these cases, because the length of the aligned read was 0. The number of diffs should be MAX_INT when there is no overlap. I have fixed this bug and the change will be part of our next release.

qinglong · November 18, 2020, 4:39pm

Thanks Sara!

Make sense now.

system · November 28, 2020, 4:39pm

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Count and name file mismatch Commands in mothur	6	994	May 19, 2020
Sequencing Error Rate Commands in mothur	4	1058	June 12, 2017
Mock community error rates with seq.error Commands in mothur	1	1814	March 13, 2015
filter.seqs shows error Commands in mothur	1	1821	February 17, 2016
align.seqs Error mothur bugs	1	1247	June 7, 2017

Seq.error output strange

Related topics