I’m following the Costello Analysis using my own data.
After trim.seqs(fasta=TDPool1.raw.fasta, oligos=TDPool1Barcodes.oligos2, qfile=TDPool1.raw.qual, maxambig=0, maxhomop=8, flip=T, bdiffs=1, pdiffs=2, qwindowaverage=35, qwindowsize=50) I get:
Start End NBases Ambigs Polymer
Minimum: 1 28 28 0 2
2.5%-tile: 1 29 29 0 3
25%-tile: 1 249 249 0 4
Median: 1 349 349 0 4
75%-tile: 1 420 420 0 5
97.5%-tile: 1 450 450 0 6
Maximum: 1 529 529 0 8
of Seqs: 703350
After unique.seqs(fasta=TDPool1.raw.trim.fasta) I get:
Start End NBases Ambigs Polymer
Minimum: 1 28 28 0 2
2.5%-tile: 1 40 40 0 4
25%-tile: 1 299 299 0 4
Median: 1 369 369 0 4
75%-tile: 1 429 429 0 5
97.5%-tile: 1 450 450 0 6
Maximum: 1 529 529 0 8
of Seqs: 200575
After align.seqs(candidate=TDPool1.raw.trim.unique.fasta, template=…/Mothur.source/silva.bacteria.fasta, flip=T) I get:
Some of you sequences generated alignments that eliminated too many bases, a list is provided in TDPool1.raw.trim.unique.flip.accnos. If the reverse compliment proved to be better it was reported.
It took 5067 secs to align 200575 sequences.
Start End NBases Ambigs Polymer
Minimum: 26790 28464 79 0 8
2.5%-tile: 26790 28464 79 0 8
25%-tile: 26790 28464 79 0 8
Median: 26790 28464 79 0 8
75%-tile: 26790 28464 79 0 8
97.5%-tile: 26790 28464 79 0 8
Maximum: 26790 28464 79 0 8
of Seqs: 1
How did I end up with one sequence?
I’d appreciate any help pinpointing what the heck I’m doing wrong Thank you!