You might be interested in the create.database command, Redirecting…. It provides output like:
OTUNumber Abundance repSeqName repSeq OTUConTaxonomy
1 6307 GQY1XT001C296C A-GC–GA-G-A-A-G-T-A … GT-GAA Bacteria(100);“Bacteroidetes”(100);“Bacteroidia”(100);…
2 5124 GQY1XT001A3TJI G-GC–GA-G-A-A-G-T-A … GT-GAA Bacteria(100);“Bacteroidetes”(100);“Bacteroidia”(100);…
3 3177 GQY1XT001CS2B8 G-GC–GA-G-A-A-G-T-A … GT-GAA Bacteria(100);“Bacteroidetes”(100);“Bacteroidia”(100);…
4 2947 GQY1XT001CD9IB G-GC–GA-G-A-A-G-T-A … GT-GAA Bacteria(100);“Bacteroidetes”(100);“Bacteroidia”(100);…
…
You can also get the abundances broken down by group.
If I do not include a groups or names file for the get.oturep command will this choose a representative sequence that is equidistant from all other sequences in that OTU from all of my groups?
Mothur finds the representative the same way regardless of whether you provide a group file or not. The method parameter allows you to select the method of selecting the representative sequence. Choices are distance and abundance. The distance method finds the sequence with the smallest maximum distance to the other sequences. If tie occurs the sequence with smallest average distance is selected. The abundance method chooses the most abundant sequence in the OTU as the representative. If you provide a group file and do not select groups, mothur will include a list of groups found in the OTU in the rep fasta file. For example: get.oturep(fasta=final.fasta, name=final.names, group=final.groups, list=final.an.list, column=final.dist, label=0.03)
GQY1XT001C296C 1|6104|F003D000-F003D002-F003D004-F003D006-F003D008-F003D142-F003D144-F003D146-F003D148-F003D150
A-G-T-G-A-GC–GA-G-A-AG-T-A–TG-C-GG-A-ATG-C-G-T-G-GT-GT-A-G-CGGT-G-AAA–TG-C-AT-AG–AT-ATC-A-C-G…
If you select groups, mothur will create a rep fasta and rep names file for each group you select. For example: get.oturep(fasta=final.fasta, name=final.names, group=final.groups, list=final.an.list, column=final.dist, label=0.03, groups=all)
GQY1XT001BQRGU 1|409|F003D000
A-G-T-G-G-GC–GA-G-A-AG-T-A–TG-C-GG-A-ATG-C-G-T-G-GT-GT-A-G-CGGT-G-AAA–TG-C-AT-AG–AT-ATC-…
Will the order of output of the representative OTUs in the fasta file be the same as in the rank-abundance file from the make.shared command?
In the fasta file the sequences are as follows: seqName otuNumber|abundance|groups. So in the example above the 1 refers to OTU1. As long as you used the same list file to make the shared file, you can compare the files.