Hi,
I have used the reference file SILVA_132_LSURef_tax_silva_trunc.fasta (593 318 kb), and taxonomy file taxmap_slv_lsu_ref_132.txt (27 175 kb), to classify my sequences (processed successfully using mothur until this step), but the run stayed forever at
Using 8 processors.
Generating search database…
What could be the reason? Are the files too big? I tried to use get.lineage to pick the taxa of interest (Eukaryota-Bacteria; Cyanobateria), but the pick was only successful in the taxonomy file. I searched throughout the fasta file and there were these taxa. There was no use by deleting the “>” symbol in the fasta file either.
what platform (OS) are you using? I had that problem once with redhat and I was just calling several times the same process instead of distributing it…
Can you try to upgrade to the most recent version of mothur? Those versions are quite old at this point. A few questions…
How many lines are in SILVA_132_LSURef_tax_silva_trunc.fasta (in windows I think you can get this by running find /c /v "" SILVA_132_LSURef_tax_silva_trunc.fasta from the command line)
Hi Pat,
There are 7484196 lines in the fasta file, and 198844 lines in the taxonomy file.
The first lines in both files:
AY224383.3948.6873 Bacteria;Firmicutes;Bacilli;Bacillales;Bacillaceae;Bacillus;Bacillus cereus
GGUUAAGUUAGAAAGGGCGCACGGUGGAUGCCUUGACACUAGGAGUCGAUGAAGGACGGGACUAACGCCGAUAUGCUUCG
GGGAGCUGUAAGUAAGCUUUGAUCCGAAGAUUUCCGAAUGGGGAAACCCACCAUACGUAAUGGUAUGGUAUCCUUAUCUG
GAUUUCCGAAUGGGGAAACCCACCAUACGUAAUGGUAUGGUAUCCUUAUCUG
I have deleted the “>” in the fasta file. In the taxonomy file, the first column was originally three separate columns, with the primaryAccession column corresponds to e.g. AY224379, and start column corresponds to e.g. 2894, and stop column corresponds to e.g. 5819. I combined these columns. Because, before I did these, there was an error like AY224379.2894.5819 was in the template file but not in the taxonomy file.
Hi Pate,
I did what you suggested. Now classify.seqs works. But get.lineage(fasta=SILVA_132_LSURef_tax_silva_trunc.fasta, taxonomy=taxmap_slv_lsu_ref_132.tx, taxon=Eukaryota-Bacteria;Cyanobacteria) still worked only with the taxonomy file, but not with the fasta file.