mothur

Silva 132 database problem

#1

Hi, I have downloaded Silva 132 from here (https://www.mothur.org/wiki/Silva_reference_files). I also download RDP database (https://www.mothur.org/wiki/RDP_reference_files).

The RDP database has two files. One is fasta, the other is id to tax file. I can use it directly. However, the Silva database is so weird. Full length sequences and taxonomy references package doesn’t have the fasta file or td to tax file like RDP database?

Do you know where I can download ready-to-use Silva database (like RDP)?

Thanks

#2

The SILVA reference file archive gives you silva.nr_v132.tax and silva.nr_v132.align. The align file is the fasta file you want to run classify.seqs.

Pat

#3

The SILVA file is compressed twice. When you decompress the *.gz file that you download you get another file which you have to decompress a second time. This will give you then the *.tax and *.fasta files that you need. I got confused by this as well.

#4

In case, you didn’t know (nor fully clear from your post), there is a detailed description on how to prepare SILVA files for use with mothur:
http://blog.mothur.org/2018/01/10/SILVA-v132-reference-files/
And and overview of previous versions with download links:
https://www.mothur.org/wiki/Silva_reference_files

#5

Also, just to be clear - you don’t have to run the code in the blog post - that is for transparency and for those that might want to tweak what we did. The actual files are provided at the wiki link from above.