New Format for Classify.seqs '.taxonomy' file?

Hi !

I’d like confirmation that there has been a change in the reporting of lineages in the ‘.taxonomy’ file. Output from v.1.39.5 reports sequences that are unclassified below a certain rank as:

k__Bacteria(100);p__Proteobacteria(100);c__Betaproteobacteria(100);o__Burkholderiales(95);f__Comamonadaceae(95);f__Comamonadaceae_unclassified(95);f__Comamonadaceae_unclassified(95);
k__Bacteria(100);p__Proteobacteria(100);c__Betaproteobacteria(100);c__Betaproteobacteria_unclassified(100);c__Betaproteobacteria_unclassified(100);c__Betaproteobacteria_unclassified(100);c__Betaproteobacteria_unclassified(100);

When I previously used Mothur, about 1.5 years ago (v.1.31.2), the reporting would have looked more like:

k__Bacteria(100);p__Proteobacteria(100);c__Betaproteobacteria(100);o__Burkholderiales(95);f__Comamonadaceae(95);g__unclassified(95);s__unclassified(95);
k__Bacteria(100);p__Proteobacteria(100);c__Betaproteobacteria(100);unclassified;unclassified;unclassified;unclassified;

Small question, but I looked and couldn’t find the answer.

In version 1.38, we decided to attach the parent taxon to the unclassified taxons. For example, if you have a sequence like:

GQY1XT001CFHYQ Bacteria(100);Firmicutes(100);Clostridia(100);Clostridiales(100);unclassified;unclassified;unclassified;

mothur appends Clostridiales to the unclassified levels below it. Where you may have seen an unclassified genus, you now can quickly see the finest taxon that was defined.

GQY1XT001CFHYQ Bacteria(100);Firmicutes(100);Clostridia(100);Clostridiales(100);Clostridiales_unclassified(100);Clostridiales_unclassified(100);Clostridiales_unclassified(100);

This change also helps clarify the *.tax.summary file.

taxlevel rankID taxon daughterlevels total
0 0 Root 1 3772
1 0.1 Bacteria 7 3772

2 0.1.6 Firmicutes 4 1727

3 0.1.6.2 Clostridia 1 1558
4 0.1.6.2.1 Clostridiales 5 1558

5 0.1.6.2.1.2 Clostridiales_unclassified 1 244
6 0.1.6.2.1.2.1 Clostridiales_unclassified 1 244
7 0.1.6.2.1.2.1.1 Clostridiales_unclassified 0 244

Versus:

taxlevel rankID taxon daughterlevels total
0 0 Root 1 3772
1 0.1 Bacteria 7 3772

2 0.1.6 Firmicutes 4 1727

3 0.1.6.2 Clostridia 1 1558
4 0.1.6.2.1 Clostridiales 5 1558

5 0.1.6.2.1.2 unclassified 1 244
6 0.1.6.2.1.2.1 unclassified 1 244
7 0.1.6.2.1.2.1.1 unclassified 0 244