Hello,
I’ve sequenced a mixture of PCR products using 454 titanium.
The PCR products were synthesised using both archaea and bacteria primers (each pair had individual barcodes) and the 454 adapters were later ligated.
As a result, I ended up with an output file in which each sequence contains one the 4 primers and some sequences contain both primers of one domain.
As I understood, preparing the oligos file in this format:
forward CATGCTGCCTCCCGTAGGAGT
reverse TCAGAGTTTGATCCTGGCTCAG
barcode AACCAACC ALP50M
barcode AACCAAGG AZAC1
barcode AACCATCG ALP2B
barcode AACCATGC ALP1B
barcode AACCGCAT ALP80M
barcode AACCGCTA ALPG2
barcode AACCGGAA AZ273
…
and parsing it to trim.seqs() will cause mothur to search for files that contain the forward AND the reverse and throw away those that contain only one of them (or none of them).
I’m looking for an option to parse an oligo file in the following format:
bac_forward CATGCTGCCTCCCGTAGGAGT
bac_reverse TCAGAGTTTGATCCTGGCTCAG
arc_forward TCGYGCASCAGKCGMGAAW
arc_reverse CAGGACTACVSGGGTATCTAAT
barcode AACCAACC ALP50M
…
And have mothur process it as: find bac_forward AND/OR bac_reverse OR arc_forward AND/OR arc_reverse.
So that mothur will recognise all sequences containing only bac_forward OR only bac_reverse OR bac_forward AND bac_reverse (in revcom form) OR bac_forward (in revcom form) AND bac_reverse …
Is that possible in any way?
Even processing only one pair at a time will do.
how do people normally deal with sequences produced using ligated adapters?
Thanks in advance,
Roey