I’m trying to improve on my pipeline for analyzing full-length 16S sequences from PacBio CCS, but after some recent changes I’ve started to get segmentation faults at the Clustering step and I’m not sure why…
“”"
mothur > align.seqs(fasta=Chun_0010.filter.fasta, flip=t, reference=/home/UNIXHOME/bbowman/data/references/16S/silva.both.align, processors=16)
mothur > summary.seqs(fasta=Chun_0010.filter.align)
mothur > screen.seqs(start=5251, fasta=Chun_0010.filter.align, end=38908, processors=16)
mothur > filter.seqs(fasta=Chun_0010.filter.good.align, vertical=T, processors=16)
mothur > unique.seqs(fasta=Chun_0010.filter.good.filter.fasta)
mothur > pre.cluster(diffs=4, fasta=Chun_0010.filter.good.filter.unique.fasta, name=Chun_0010.filter.good.filter.names)
mothur > chimera.uchime(fasta=Chun_0010.filter.good.filter.unique.precluster.fasta, processors=16, reference=/home/UNIXHOME/bbowman/data/references/16S/silva.gold.align)
mothur > remove.seqs(fasta=Chun_0010.filter.good.filter.unique.precluster.fasta, accnos=Chun_0010.filter.good.filter.unique.precluster.uchime.accnos)
mothur > dist.seqs(output=lt, fasta=Chun_0010.filter.good.filter.unique.precluster.pick.fasta, calc=onegap, processors=16, countends=F)
mothur > cluster(phylip=Chun_0010.filter.good.filter.unique.precluster.pick.phylip.dist, name=Chun_0010.filter.good.filter.unique.precluster.names, method=average)
“”"
The process reaches the clustering step consistently, but then seg-faults out during the reading in of the matrix. I’ve repeated this process on both the default version of Mothur I had installed (v1.30) and with the most recent stable build (v1.33.2) and observed the same behavior.
I can send the raw-data file and my logfiles if needed (fasta is ~3MB zipped)
-Brett