I ran into this problem when I went to do my shared file for a phylotype based analysis.
[WARNING]: Your count file does not have group info, all reads will be assigned to mothurGroup.
I didn’t think anything of it at the time because I knew there were some new synthesis of the commands in this version. BUT when I got to my count groups prior to subsampling there was only 1 group… uhoh
I noticed that in my pre.cluster command I got this note:
When using running without group information mothur can only use 1 processor, continuing.
I went back through my count_tables from all my commands and it looks like for some reason after I run unique.seqs the first time my count_table file no longer includes my samples as groups. So the file looks something like this:
As opposed to what it usually does which is more like:
#Compressed Format: groupIndex,abundance. For example 1,6 would mean the read has an abundance of 6 for group HC1.
#1,HC1 2,HC100 3,HC101 4,HC102 5,HC103 6,HC104 7,HC105 8,HC106 9,HC107 10,HC108 11,HC109 12,HC11 13,HC110 14,HC111 15,HC112 16,HC113 17,HC114 18,HC115 19,HC116 20,HC117 21,HC118 22,HC119 23,HC12 24,HC120
so all that jazz BEFORE the representative sequence total info.
I also noticed that in the SOP after the first screen.seqs command the files are named:
Whereas mine are:
I don’t think there is any difference in these files other than they have the “good”, but then throughout the workflow the file names don’t really align.
Not really sure where I might have gone wrong or what I’m missing