Hello,
I have to analyse 454 FLX+ sequences and in the beginning I tried to analyse the forward and reverse sequences (half of the sequences randomly starts from forward or reverse). Now I tried to combine them and this works out until the cluster command that will stuck (I don´t get an error message).
I used:
mothur v.1.31.2
trim.seqs(fasta=…, oligos=…, minlength=500, maxlength=1050, qaverage=35, qwindowsize=50, maxhomop=8, pdiffs=0)
…
align.seqs(fasta=16S_trim.unique.fasta, reference=silva.bacteria.fasta, align=gotoh, match=1, mismatch=-3, flip=T, processors=8)
screen.seqs(fasta=16S_trim.unique.align, maxhomop=8, name=16S_trim.names, group=mergegroupspick, minlength=500, processors=8)
filter.seqs(fasta=16S_trim.unique.good.align, vertical=T)
unique.seqs(fasta=16S_trim.unique.good.filter.fasta, name=16S_trim.good.names)
pre.cluster(fasta=16S_trim.unique.good.filter.unique.fasta, name=16S_trim.unique.good.filter.names, group=mergegroupspickgood, diffs=2, processors=8)
chimera.uchime
remove.seqs
classify.seqs
remove.lineage
summary from here:
Start End NBases Ambigs Polymer NumSeqs
Minimum: 1 1634 500 0 4 1
2.5%-tile: 1 1940 520 0 4 2073
25%-tile: 1 2767 659 0 5 20728
Median: 1 3005 752 0 5 41455
75%-tile: 506 3005 826 0 6 62182
97.5%-tile: 1218 3047 877 2 7 80837
Maximum: 1506 3094 936 6 8 82909
Mean: 284.745 2827.17 735.422 0.182393 5.42115
of unique seqs: 73481
total # of seqs: 82909
dist.seqs(fasta=16S_trim.unique.good.filter.unique.precluster.pick.pick.fasta, output=lt, processors=8)
cluster(phylip=16S_trim.unique.good.filter.unique.precluster.pick.pick.phylip.dist, name=16S_trim.unique.good.filter.unique.precluster.pick.pick.names)
Hm, I think the problem is that the sequences don´t align in the same area? I also tried to use a cutoff of 0.03 but I end up with some unique sequences.
Is there any way to get this done?
Could it help when I run the trim.flows etc. with the sff files? I am trying this in the moment but I am failing… But if this would help I would ask you about this later.
It would be so great if you could help!
Thank you so much!


And it also works for the ITS sequences. That is so good!