Hello,
I’m trying to split my sequences into two groups (from two experiments) using the remove.groups command. Here’s my input:
remove.groups(fasta=L3.final.fasta, group=L3.final.groups, name=L3.final.names, groups=Sample01-Sample02-Sample03-Sample04-Sample05-Sample06-Sample07-Sample08-Sample09-Sample10-Sample11-Sample12-Sample13-Sample14-Sample15-Sample16-Sample17-Sample18-Sample19-Sample20-Sample21-Sample22)
And mothur’s output:
Removed 32037317 sequences from your name file.
Removed 199005 sequences from your fasta file.
Removed 32037317 sequences from your group file.
Running summary.seqs and count.groups, however, reveals a problem:
of unique seqs: 64513
total # of seqs: 8378246
mothur > count.groups(group=L3.final.pick.groups)
Sample70 contains 390385.
Sample71 contains 153851.
Sample72 contains 404284.
Sample73 contains 505864.
Sample74 contains 2379091.
Sample75 contains 533678.
Sample76 contains 1120613.
Sample77 contains 1393437.
Sample78 contains 646127.
Sample79 contains 2775750.
Sample80 contains 723047.
If you add up the number of sequences from count.groups, you get 11026127…about 3 million more sequences than the summary says I have in the names file. Furthermore, the numbers from count.groups match the numbers from these groups before running remove.groups - so it seems like there’s something wrong with the names file after running remove.groups, like it’s removed quite a few sequences that should have been left in. If I run remove.groups and remove samples 70-80 instead, the same thing happens: there’s the right number of sequences in the groups file, but too few in the names file. (If I remove both and run summary.seqs on the two separated groups and the original fasta, I get the right number of uniques, so the fasta seems to be okay.) The number of sequences in the names file matches the total number of sequences in the groups file before this step, so everything seems okay until I run remove.groups.
I’ve also tried running get.groups instead, and get the same results. Any idea what could be causing this, or how to fix it? Thank you very much!