Hi all,
I am running a 16S rRNA sequencing run of a mock community DNA sample (from Zymo - should be 8 bacterial members) through the Mothur pipeline. I am following the MiSeq SOP online and notes that I have from a one-day workshop Pat taught at EDAMAME. Anyways, things were going pretty smoothly until I reached the chimera.vsearch step. The output should include a count table (for which I can include in the classify.seqs step) but it does not give me one in the output. Below is my script and results from using chimera.vsearch and further steps. It seemed like this step worked, but it just didn’t give a count table…
From the chimera.vsearch step, I still attempted to run classify.seqs with the current count table (looks like it used from pre.cluster step) and then remove lineage, and run summary.tax() but there are no sequences. Even when I run summary.tax() specifying the file that comes out of classify.seqs (before removing lineage) it gives back the message, 0 seconds 0 sequences. I’m not exactly sure what is going on here. I’m assuming something is not lining up right from the chimera.vsearch step on. But maybe it’s something else? As a note, I’ve already run this mock sample/sequencing data through QIIME and Uparse pipelines already and everything seemed OK, I don’t think there’s anything technically wrong with it, at least that I can detect.
Any help you could provide would be much appreciated!
Thank you in advance,
Vanessa
Chimera.vsearch: chimera.vsearch(fasta=PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.fasta, count=PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.count_table, dereplicate=t)
Using 1 processors.
/opt/mothur/1.39.0/mothurvsearch file does not exist. Checking path…
Found vsearch in your path, using /home/resgoodman/lab/tools/vsearch/1.11.1/bin/vsearch
Checking sequences from PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.fasta …
vsearch v1.11.1_linux_x86_64, 1514.0GB RAM, 48 cores
Reading file PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.temp 100%
6050 nt in 24 seqs, min 251, max 254, avg 252
Masking 100%
Sorting by abundance 100%
Counting unique k-mers 100%
Detecting chimeras 100%
Found 4 (16.7%) chimeras, 20 (83.3%) non-chimeras,
and 0 (0.0%) borderline sequences in 24 unique sequences.
Taking abundance information into account, this corresponds to
15 (0.1%) chimeras, 16979 (99.9%) non-chimeras,
and 0 (0.0%) borderline sequences in 16994 total sequences.
It took 0 secs to check 24 sequences. 4 chimeras were found.
Output File Names:
PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.denovo.vsearch.chimeras
PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.denovo.vsearch.accnos
Removed chimeras:
remove.seqs(fasta=PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.fasta, accnos=PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.denovo.vsearch.accnos)
Removed 4 sequences from your fasta file.
Output File Names:
PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.pick.fasta
classify.seqs(fasta=current, count=current, reference=reference/trainset9_032012.pds.fasta, taxonomy=reference/trainset9_032012.pds.tax, cutoff=80)classify.seqs(fasta=current, count=current, reference=reference/trainset9_032012.pds.fasta, taxonomy=reference/trainset9_032012.pds.tax, cutoff=80) Using PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.pick.pick.count_table as input file for the count parameter. Using PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.pick.fasta as input file for the fasta parameter.
Using 1 processors.
Reading template taxonomy… DONE.
Reading template probabilities… DONE.
It took 5 seconds get probabilities.
Classifying sequences from PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.pick.fasta …
Processing sequence: 20
It took 0 secs to classify 20 sequences.
It took 0 secs to create the summary file for 20 sequences.
Output File Names: PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.pick.pds.wang.taxonomy PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.pick.pds.wang.tax.summary
Remove lineage:
remove.lineage(fasta=current, count=current, taxonomy=PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.pick.pds.wang.taxonomy, taxon=Chloroplast-Mitochondria-unknown-Archaea-Eukaryota)
Using PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.pick.pick.count_table as input file for the count parameter.
Using PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.pick.fasta as input file for the fasta parameter.
[NOTE]: The count file should contain only unique names, so mothur assumes your fasta, list and taxonomy files also contain only uniques.
Output File Names: PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.pick.pds.wang.pick.taxonomy PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.pick.pick.fasta PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.pick.pick.pick.count_table
summary.tax() Using PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.pick.pds.wang.pick.taxonomy as input file for the taxonomy parameter.
It took 0 secs to create the summary file for 0 sequences.
Output File Names: PositiveControl/Positive.trim.contigs.good.unique.good.filter.unique.precluster.pick.pds.wang.pick.tax.summary