How can I extract Archaeal sequences also ?

Kumari_Richa · October 9, 2014, 11:52am

Hello Mothur members.

I have amplified the V4-V5 hypervariable region of 16 S rRNA gene using the primer sets specific to both Bacteria and Archaea. I follow MiSeq SOP where SILVA-based bacterial reference alignment is provided. I want to also extract the archaeal ssequences from my samples. Could you please suggest how I can do so ?
(My OS is Windows 64 BIT, and mothur version is 1.33.3)

Looking forward,

dwaite · October 9, 2014, 7:50pm

After you’ve classified your sequences you can extract particular lineages using the get.lineage() command.

For example,

classify.seqs(fasta=yourfile.fasta, count=yourfile.count_table, taxonomy=trainset9_032012.pds.tax, template=trainset9_032012.pds.fasta)
get.lineage(fasta=yourfile.fasta, count=yourfile.count_table, taxonomy=yourfile.pds.wang.taxonomy, taxon=Archaea)

This works with any of the classification databases, you just need to modify your name accordingly, for example in Greengenes it’s ‘k__Archaea’.

Kumari_Richa · October 10, 2014, 9:57am

Thank you very much for the explanation.

In case if I want to allign and classify bacteria and Archaea together (in one step in mothur), what shoud I do? This is because I want to see archaeal and bacterial otus combined in one picture.

Which reference file should I use for doing this, so that I can allign bacteria and archaea together?

Looking forward

pschloss · October 20, 2014, 9:24pm

You need to use something like greengenes or silva where both archaea and bacterial sequences are included.

Kumari_Richa · October 22, 2014, 8:39am

Hello Dr. Schloss,

Thanks for your suggestion.

In the silva and greengenes reference files posted in http://www.mothur.org/wiki/Silva_reference_files and http://www.mothur.org/wiki/Greengenes-formatted_databases respectively, where can I find fasta file to be fed at PCR.seq (or allign.seq) and for classify.seqs command ? I am sorry but I got confused while switching reference files provided in Miseq SOP to these reference files.
I read in http://www.mothur.org/wiki/Greengenes-formatted_databases ,about greengenes reference allignment, that it should be used (if necessary, but why!?) to align sequences. You also suggest not to use it for real analysis due to poor allignment. So, what can be used for allignment in place of this if one is using greengenes.
My second question is while using greengenes, how can I find start and end position of my sequences that has to be fed at pcr.seqs command line ?

Looking forward,

Richa

looking forward,

Richa

pschloss · October 24, 2014, 7:24pm

In the silva and greengenes reference files posted in > Redirecting… > and > http://www.mothur.org/wiki/Greengenes-f > … _databases respectively, where can I find fasta file to be fed at PCR.seq (or allign.seq) and for classify.seqs command ? I am sorry but I got confused while switching reference files provided in Miseq SOP to these reference files.

If you download the compressed files from there you’ll find fasta formatted align files that you can use.

I read in > http://www.mothur.org/wiki/Greengenes-f > … _databases ,about greengenes reference allignment, that it should be used (if necessary, but why!?) to align sequences. You also suggest not to use it for real analysis due to poor allignment. So, what can be used for allignment in place of this if one is using greengenes.

Do not use greengenes for alignments. You can use it for classification. Use silva for alignments and classification if you want.

My second question is while using greengenes, how can I find start and end position of my sequences that has to be fed at pcr.seqs command line ?

get an ecoli sequence and trim it to your primer positions
align it against the greengenes alignment
run summary.seqs on the aligned sequence
use the start and end coordinates.

Kumari_Richa · October 27, 2014, 4:56pm

Hello Dr. Schloss,

I used recreated seed database files both for allignment and also for classification. I got better results. But I did not see Archaea in my files. I expect both Archaea and Bacteria in my samples because I used the primer sets that target both. kindly suggest me how to proceed, so that I can find a way.

looking forward,

Richa

pschloss · October 28, 2014, 4:03pm

Have you seen these pages?

http://blog.mothur.org/2014/08/08/SILVA-v119-reference-files/
http://www.mothur.org/wiki/Silva_reference_files

There are archaea in there.

Kumari_Richa · November 7, 2014, 12:48pm

Hello,

When I use recreated seed database reference files for classification, I get this kind of message in mothur and this step fails:

‘AB183858.UncB4157’ is in your template file and is not in your taxonomy file. Please correct.
‘HQ197980.HvnArane’ is in your template file and is not in your taxonomy file. Please correct.
‘L01575.ThyFlexu’ is in your template file and is not in your taxonomy file. Please correct.
‘AB282889.LacSimi2’ is in your template file and is not in your taxonomy file. Please correct.

-I used the same silva.seed_v119.align file for alignment.

please help and suggest.

looking forward.

pschloss · November 11, 2014, 6:07pm

Since you seem to be having problems with a lot of things across mothur, I would strongly encourage you to use files that we know work before you go off and try new things.

Topic		Replies	Views
Classification for archaea Feature requests	2	4494	July 20, 2010
Splitting up mixed Bacteria/Archaea 16S rRNA datasets Commands in mothur	4	4166	May 9, 2014
Reference database and custom database Commands in mothur	1	1501	November 22, 2016
Need to separate Bacteria, Archaea and Eukarya sequences Commands in mothur	4	537	December 5, 2019
Aliging sequences prior to phylip distance and Theory behind mothur	1	2337	March 30, 2015

How can I extract Archaeal sequences also ?

Related topics