pcr.seqs

Hi. I am brand new to mothur (taking the workshop in August) and brand new to a project in my lab that has been sitting and waiting for someone to pick up. Thus…I have a very general question. Is there an easy, straight-forward way of determining the start and end point for my sequences of interest?

Thanks

Looking forward to meeting you! Try this…

  1. Take ecoli’s 16s and trim it to the region being amplified by your primers
  2. Align the trimmed sequence to silva.bacteria.fasta
  3. Run the *align file through summary.seqs - take note of the start and end positions

THANKS!

Unfortunately though I’ve run into another problem. Now when I run my pcr.seqs command everything looks like it’s going fine until its gets about a fourth in and I get these errors:

[ERROR]: 26590.temp is blank. Please correct.
[ERROR]: 26590.temp is blank. Please correct.
[ERROR]: name mismatch in pcr.seqs.
[ERROR]: name mismatch in pcr.seqs.

The second error (name mismatch in pcr.seqs) occurs about a million times (exaggerating a little).

Thoughts???

Thanks! Looking forward to meeting you in a month as well!

Darcy

Are you doing this in windows, mac, or linux? How many processors? Can you try reducing the number of processors?

I’m doing this on a Mac with 1 processor.

What’s the exact syntax of the command you’re trying to run?

I’ve tried it two ways:

  1. with batch file as follows
    make.contigs(file=stability.files, processors=1)
    summary.seqs(fasta=stability.trim.contigs.fasta)
    screen.seqs(fasta=stability.trim.contigs.fasta, group=stability.contigs.groups, maxambig=0, maxlength=275)
    summary.seqs(fasta=current)
    get.current()
    unique.seqs(fasta=stability.trim.contigs.good.fasta)
    count.seqs(name=stability.trim.contigs.good.names, group=stability.contigs.good.groups)
    summary.seqs(count=stability.trim.contigs.good.count_table)
    pcr.seqs(fasta=silva.bacteria.fasta, start=13862, end=23444, keepdots=F, processors=1)
    system(mv silva.bacteria.pcr.fasta silva.v4.fasta)
    summary.seqs(fasta=silva.v4.fasta)
    align.seqs(fasta=stability.trim.contigs.good.unique.fasta, reference=silva.v4.fasta)
    summary.seqs(fasta=stability.trim.contigs.good.unique.align, count=stability.trim.contigs.good.count_table)
    get.current()

  2. separately with each command as an individual

You said the command that was giving you problems was this:

pcr.seqs(fasta=silva.bacteria.fasta, start=13862, end=23444, keepdots=F, processors=1)


Right? I just ran that with no issue. Am I missing something?

Right. I’m sure the command itself is fine but I’m still getting the above errors when using my data set and I don’t know where to look to attempt to fix the errors. I don’t really understand the errors themselves. Can you explain the nature of the error and perhaps I can go from there? Thanks.

I think the only way you can get those errors is if you are using multiple processors, so I’m really not sure what’s going on. What version of mothur are you using?

@pschloss referring to the first question of this trend, is it also a reliable method to use the coordinates you give in SOP (for V4) and here (for V3) if, for example I know I have done my seqeuncing for V3 and V4 region of 16S rRNA gene?
If so, is it correct if I use pcr.seqs with as parameters start=6388 and end=23444?

Thanks in advance

I think that should be fine
Pat

1 Like