Align sequence with no primers

allan_santos · February 26, 2024, 4:33pm

Dear all,
May I perform the align.seqs using samples without primers+adapters (we’ve sequenced in BGI facilities and they shared the CleanData without primers) and Silva reference excluding primer sequence?
I’m following the steps according to SOP.
I’ve done it and the summary seqs return like this:

mothur > align.seqs(fasta=luan.trim.contigs.good.unique.fasta, reference=silva.nr_v138_1.pcr.align)

Using 64 processors.

Reading in the silva.nr_v138_1.pcr.align template sequences… DONE.
It took 19 to read 146601 sequences.

Aligning sequences from luan.trim.contigs.good.unique.fasta …
It took 36 secs to align 117429 sequences.

[WARNING]: 3 of your sequences generated alignments that eliminated too many bases, a list is provided in luan.trim.contigs.good.unique.flip.accnos.
[NOTE]: 1 of your sequences were reversed to produce a better alignment.

It took 37 seconds to align 117429 sequences.

Output File Names:
luan.trim.contigs.good.unique.align
luan.trim.contigs.good.unique.align_report
luan.trim.contigs.good.unique.flip.accnos

mothur > summary.seqs(fasta=current, count=luan.trim.contigs.good.count_table)
Using luan.trim.contigs.good.unique.align as input file for the fasta parameter.

Using 64 processors.

	Start	End	NBases	Ambigs	Polymer	NumSeqs

Minimum: 1 18 7 0 3 1
2.5%-tile: 1 9583 253 0 4 26699
25%-tile: 1 9583 253 0 5 266984
Median: 1 9583 253 0 5 533967
75%-tile: 1 9583 253 0 6 800950
97.5%-tile: 1 9583 253 0 6 1041235
Maximum: 8716 9583 270 0 8 1067933
Mean: 1 9582 252 0 5

of unique seqs: 117429

total # of seqs: 1067933

pschloss · February 27, 2024, 6:55pm

Hi Allan,

That looks right. You’ll want to run screen.seqs next to remove the short sequences.

Pat

allan_santos · February 27, 2024, 8:38pm

Hi Pat,
Thanks for replying.
Then I can’t proceed with align.seqs command without the primers using this ‘clean data’? Should I run align seqs with the primers and then remove them with screen seqs?

Thanks

pschloss · March 7, 2024, 1:39pm

Can you remove the primers and barcodes in make.contigs? You’ll definitely want to know which sample each sequence belongs to.

Pat

system · March 17, 2024, 1:40pm

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Errors following align.seqs Commands in mothur	8	651	January 26, 2024
using the illumina 16s sequenceing primers with pcr.seqs to prepare SILVA database. Commands in mothur	1	1315	May 16, 2016
pcr.seqs with oligos file Commands in mothur	3	1546	March 22, 2016
Silva DBs Commands in mothur	1	2766	November 11, 2014
pcr.seqs, align.seqs :: Why I observe length discrepancy? Commands in mothur	1	750	May 5, 2017

Align sequence with no primers

of unique seqs: 117429

Related topics