Hi, can somebody help me with this?. After running chimera.vsearch and remove.seqs I tried running the command summary.seqs following the MiSeq SOP and keep getting an error. How do I fix this? Thanks
Using stability.trim.contigs.good.unique.good.filter.unique.precluster.count_table as input file for the count parameter.
Using stability.trim.contigs.good.unique.good.filter.unique.precluster.pick.fasta as input file for the fasta parameter.
Using 4 processors.
[ERROR]: Your count file contains 151475 unique sequences, but your fasta file contains 120068. File mismatch detected, quitting command.
I think you are getting the error because you are not using the count_table file that was generated after the “chimera.vsearch” step.
For summary in this case, you should take chimera search and removal steps altogether. I think you should use the fasta file that was generated after the “remove.seqs” step and the count_table file that was generated after the “chimera.vsearch” step.
sorry… but I was wrong, I don’t have count_table after running chimera.vsearch just those files: Output File Names:
stability.trim.contigs.good.unique.good.filter.unique.precluster.denovo.vsearch.chimeras
stability.trim.contigs.good.unique.good.filter.unique.precluster.denovo.vsearch.accnos
Look, first try to understand what these “chimera.vsearch” and “remove.seqs” commands do.
“chimera.vsearch” doesn’t remove any chimeric sequences from the fasta file but it removes from the count_table file. And produces an accnos file which contains all the removed chimeric sequences only. Now, in the next step you will use this accnos file together with the fasta file so that the fasta file can understand which sequences have to be removed.
so, you have to consider these two steps altogether as an unit.
Let me know if you understand.