The RDP database has two files. One is fasta, the other is id to tax file. I can use it directly. However, the Silva database is so weird. Full length sequences and taxonomy references package doesn’t have the fasta file or td to tax file like RDP database?
Do you know where I can download ready-to-use Silva database (like RDP)?
The SILVA file is compressed twice. When you decompress the *.gz file that you download you get another file which you have to decompress a second time. This will give you then the *.tax and *.fasta files that you need. I got confused by this as well.
Also, just to be clear - you don’t have to run the code in the blog post - that is for transparency and for those that might want to tweak what we did. The actual files are provided at the wiki link from above.
open your software first (WinZip, WinRAR or whatever you use) and then open and unzip the file, it doesn’t need an extension for that. At least 7-Zip, which I use, has no problems with that.