I’d like to generate otu matrices of specific phyla. How could I do this? I’ve classified the otu’s so have the taxonomy file and the shared file.
at this poitn the easiest way would be to run get.linage for the groups you want, dist.seqs, cluter, make.shared
oh bummer. a few of my phyla are huge (100k preclustered seqs) redoing the distance matrix and clustering on that phyla, which naturally would be my first target, would take a week or so.
Thanks for the feature request. In version 1.31.0 you will be able to do the following:
mothur > classify.otu(list=final.an.list, taxonomy=final.taxonomy, name=final.names) - classify OTUs
mothur > get.lineage(constaxonomy=final.an.0.03.cons.taxonomy, shared=final.an.shared, label=0.03, taxon=‘Bacteria;Firmicutes;’) - select OTUs that classified to Bacteria;Firmicutes;.
Output File Names:
final.an.0.03.cons.pick.taxonomy
final.an.0.03.pick.shared
or
mothur > classify.otu(list=final.an.list, taxonomy=final.taxonomy, name=final.names) - classify OTUs
mothur > get.lineage(constaxonomy=final.an.0.03.cons.taxonomy, list=final.an.list, label=0.03, taxon=‘Bacteria;Firmicutes;’) - select OTUs that classified to Bacteria;Firmicutes;.
Output File Names:
final.an.0.03.cons.pick.taxonomy
final.an.0.03.pick.list
sweet, thanks Sarah
Hi, very nice - works fine.
Can I also extract a fasta file?
Thanks
To get the fasta file containing the sequences in the final.an.0.03.pick.list file, try running:
list.seqs(list=final.an.0.03.pick.list)
get.seqs(fasta=yourFastaFile, name=yourNameFile)
You may also want to include your group file:
list.seqs(list=final.an.0.03.pick.list)
get.seqs(fasta=yourFastaFile, name=yourNameFile, group=yourGroupFile)
works just fine! Thanks a lot!