Hello,
I would like to use the perseus algorithm to check a dataset for chimeras, but am getting a recurring error:
is in your fasta file and not in your namefile, please correct.
More specifically, the first sequence in all lines of the name file is read correctly, but any additional sequences linked to that representative sequences (ie. anything after a comma) is ignored. I assume this is a product of the fact that my name file was not generated within mothur. The file was generated manually, as I am using other pipelines more suitable to my data for most of my analysis. However, they don’t have chimera-checking options, so I was hoping to use Mothur to run Perseus. The name file is correctly formatted according to the information on the mothur wiki as far as I can tell - representative sequence in first column, all the sequences it represents separated by commas in the second column. An example of both input files is below. Any insight on the problem would be greatly appreciated!
-Marie
Sample Name File:
21251222902 21251222902,2121110114
2121110232 2121110232,232508291
Sample fasta:
21251222902
GAAATGCGATAAGTAATGTGAATTGCAGAATTCAGTGAATCATCGAATCTTTGAACGCACCTTGCGCCCTTTGGTATTCCGAAGGGCATGCCTGTTTGAGTGTCATTAAATTCTCAACCTTGCTCGCCTTTACCGGCTTGAGTGAGGCTTGGACGTGAGGGCTTTGCTGGCTTCCTTAAGTGGATGGTCTGCTCCCTTTAAATGCATTAGTGGGATCTCTTGTGGACCGTCACTTGGTGTGATAATTATCTACGCCTCGTCGTACTTTGAAGACAAACTTATGGGAACCTGCTTATAACCGTCTCGACGAAGGGACTAACTTTCTGACTATTTGACCTACAAATCAGGTACGGACCTACCCGCTA
2121110114
GAAATGCGATAAGTAATGTGAATTGCAGAATTCAGTGAATCATCGAATCTTTGAACGCACCTTGCGCCCTTTGGTATTCCGAAGGGCATGCCTGTTTGAGTGTCATTAAATTCTCAACCTTGCTCGCCTTTACCGGCTTGAGTGAGGCTTGGACGTGAGGGCTTTGCTGGCTTCCTTAAGTGGATGGTCTGCTCCCTTTAAATGCATTAGTGGGATCTCTTGTGGACCGTCACTTGGTGTGATAATTATCTACGCCTCGTCGTACTTTGAAGACAAACTTATGGGAACCTGCTTATAACCGTCTCGACGAAGGGACTAACTTTCTGACTATTTGACCTACAAATCAGGTACGGACCTACCCGCTA
2121110232
GAAATGCGATAAGTAATGTGAATTGCAGAATTCAGTGAATCATCGAATCTTTGAACGCACCTTGCGCCCTTTGGTATTCCGAAGGGCATGCCTGTTTGAGTGTCATTAAATTCTCAACCTTGCTCGCCTTTACCGGCTTGAGTGAGGCTTGGACGTGAGGGCTTTGCTGGCTTCCTTAAGTGGATGGTCTGCTCCCTTTAAATGCATTAGTGGGATCTCTTGTGGACCGTCACTTGGTGTGATAATTATCTACGCCTCGTCGTACTTTGAAGACAAACTTATGGGAACCTGCTTATAACCGTCTCGACGAAGGGACTAACTTTCTGACTATTTGACCTACAAATCAGGTACGGACCTACCCGCTA
232508291
GAAATGCGATAAGTAATGTGAATTGCAGAATTCAGTGAATCATCGAATCTTTGAACGCACCTTGCGCCCTTTGGTATTCCGAAGGGCATGCCTGTTTGAGTGTCATTAAATTCTCAACCTTGCTCGCCTTTACCGGCTTGAGTGAGGCTTGGACGTGAGGGCTTTGCTGGCTTCCTTAAGTGGATGGTCTGCTCCCTTTAAATGCATTAGTGGGATCTCTTGTGGACCGTCACTTGGTGTGATAATTATCTACGCCTCGTCGTACTTTGAAGACAAACTTATGGGAACCTGCTTATAACCGTCTCGACGAAGGGACTAACTTTCTGACTATTTGACCTACAAATCAGGTACGGACCTACCCGCTA