Problem with mothur v 1.47 count_table is not maintaining groups

I ran into this problem when I went to do my shared file for a phylotype based analysis.

[WARNING]: Your count file does not have group info, all reads will be assigned to mothurGroup.

I didn’t think anything of it at the time because I knew there were some new synthesis of the commands in this version. BUT when I got to my count groups prior to subsampling there was only 1 group… uhoh

I noticed that in my pre.cluster command I got this note:
When using running without group information mothur can only use 1 processor, continuing.

I went back through my count_tables from all my commands and it looks like for some reason after I run unique.seqs the first time my count_table file no longer includes my samples as groups. So the file looks something like this:

Representative_Sequence total
M05184_45_000000000-KB33R_1_2112_21024_17034 276976
M05184_45_000000000-KB33R_1_1103_3487_14047 702
M05184_45_000000000-KB33R_1_1106_15957_20033 21465
M05184_45_000000000-KB33R_1_1102_14353_26226 954
M05184_45_000000000-KB33R_1_2113_16846_8860 50451
M05184_45_000000000-KB33R_1_1109_26590_14472 25
M05184_45_000000000-KB33R_1_2107_22239_13176 66

As opposed to what it usually does which is more like:

#Compressed Format: groupIndex,abundance. For example 1,6 would mean the read has an abundance of 6 for group HC1.
#1,HC1 2,HC100 3,HC101 4,HC102 5,HC103 6,HC104 7,HC105 8,HC106 9,HC107 10,HC108 11,HC109 12,HC11 13,HC110 14,HC111 15,HC112 16,HC113 17,HC114 18,HC115 19,HC116 20,HC117 21,HC118 22,HC119 23,HC12 24,HC120

so all that jazz BEFORE the representative sequence total info.

I also noticed that in the SOP after the first screen.seqs command the files are named:
fasta=stability.trim.contigs.fasta
contigsreport=stability.contigs_report
count=stability.contigs.count_table

Whereas mine are:
stability.trim.contigs.good.fasta
stability.trim.contigs.bad.accnos
stability.contigs.good.count_table

I don’t think there is any difference in these files other than they have the “good”, but then throughout the workflow the file names don’t really align.

Not really sure where I might have gone wrong or what I’m missing :confused: :confused:

The issue is caused by not including the existing count file in commands. It starts with this command:

mothur > unique.seqs(fasta=current) - generate new count file with no groups

try this instead:

mothur > unique.seqs(fasta=current, count=current) - update existing count file to combine identical reads.

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.