Problem with mothur v 1.47 count_table is not maintaining groups

jmacnico · March 16, 2022, 10:48pm

I ran into this problem when I went to do my shared file for a phylotype based analysis.

[WARNING]: Your count file does not have group info, all reads will be assigned to mothurGroup.

I didn’t think anything of it at the time because I knew there were some new synthesis of the commands in this version. BUT when I got to my count groups prior to subsampling there was only 1 group… uhoh

I noticed that in my pre.cluster command I got this note:
When using running without group information mothur can only use 1 processor, continuing.

I went back through my count_tables from all my commands and it looks like for some reason after I run unique.seqs the first time my count_table file no longer includes my samples as groups. So the file looks something like this:

Representative_Sequence	total
M05184_45_000000000-KB33R_1_2112_21024_17034	276976
M05184_45_000000000-KB33R_1_1103_3487_14047	702
M05184_45_000000000-KB33R_1_1106_15957_20033	21465
M05184_45_000000000-KB33R_1_1102_14353_26226	954
M05184_45_000000000-KB33R_1_2113_16846_8860	50451
M05184_45_000000000-KB33R_1_1109_26590_14472	25
M05184_45_000000000-KB33R_1_2107_22239_13176	66

As opposed to what it usually does which is more like:

#Compressed Format: groupIndex,abundance. For example 1,6 would mean the read has an abundance of 6 for group HC1.
#1,HC1 2,HC100 3,HC101 4,HC102 5,HC103 6,HC104 7,HC105 8,HC106 9,HC107 10,HC108 11,HC109 12,HC11 13,HC110 14,HC111 15,HC112 16,HC113 17,HC114 18,HC115 19,HC116 20,HC117 21,HC118 22,HC119 23,HC12 24,HC120

so all that jazz BEFORE the representative sequence total info.

I also noticed that in the SOP after the first screen.seqs command the files are named:
fasta=stability.trim.contigs.fasta
contigsreport=stability.contigs_report
count=stability.contigs.count_table

Whereas mine are:
stability.trim.contigs.good.fasta
stability.trim.contigs.bad.accnos
stability.contigs.good.count_table

I don’t think there is any difference in these files other than they have the “good”, but then throughout the workflow the file names don’t really align.

Not really sure where I might have gone wrong or what I’m missing

westcott · March 17, 2022, 3:51pm

The issue is caused by not including the existing count file in commands. It starts with this command:

mothur > unique.seqs(fasta=current) - generate new count file with no groups

try this instead:

mothur > unique.seqs(fasta=current, count=current) - update existing count file to combine identical reads.

system · March 27, 2022, 3:51pm

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How do I add group info to .count_table Commands in mothur	8	523	March 11, 2022
How to add group information to a countfile? Commands in mothur	1	1134	August 9, 2016
Issue with the count table from count.seqs() mothur bugs	1	1686	September 8, 2015
Pre.cluster is very lslow and the fasta file which it produce is blank	6	542	February 20, 2024
Count.groups discrepancy? Commands in mothur	3	255	December 2, 2023

Problem with mothur v 1.47 count_table is not maintaining groups

Related topics