I downloaded the Full-length sequences and Taxonomy file from the wiki. When I unzipped the file, there is no file extension and Mothur is unable to use this file for align.seqs. Do I need to follow the process in the README.md to make the .align file for this reference?
The unzipped file is ~6.9 Gb and I am working from a Windows computer if that is useful.
How did you extract the silva.nr_v138.tgz file? Windows can be notoriously finicky when extracting tgz files. When the file is properly extracted it should like this:
The one I downloaded was just .gz so I used the command gunzip to extract the files. I ended up using 7-zip to extract files again at another recommendation which I was able to access the files that way. Any other suggestion for how to extract the files in one command would be appreciated though.
I would recommend re-downloading the file because it should be in .tgz format. If it isn’t in this format when downloaded then it isn’t properly being extracted. How did you originally download the file?
Once you have a tgz file then you will want to follow the instructions for extracting a tgz with 7-zip. I’m not sure what your terminal setup is on your computer but typically with macs and linux you would use the following command to extract a tgz file:
tar -xvzf Silva.nr_v138
Depending on your setup, you may or may not have the tar command.