Hello!
I am new to analysis and am using Mothur with paired-end MiSeq 16S V4 data for the first time. I was able to merge my forward and reverse reads which was summarized:
Start End NBases Ambigs Polymer NumSeqs
Minimum: 1 294 294 0 3 1
2.5%-tile: 1 404 404 0 4 706035
25%-tile: 1 405 405 0 4 7060343
Median: 1 410 410 1 5 14120686
75%-tile: 1 424 424 3 5 21181028
97.5%-tile: 1 429 429 14 7 27535336
Maximum: 1 602 602 297 301 28241370
Mean: 1 415 415 2 4
# of Seqs: 28241370
I then attempted to remove ambiguous base calls and problematic sequences as recommended in the code where maxambig=0 and maxlength=275, but this eliminated all of my sequences:
It took 166 secs to screen 28241370 sequences, removed 28241370.
Could anyone possibly point out where I might be going wrong and/or how I can visualize the quality scores to determine the correct cutoff point for this data?
Thank you!
Lindsay