Hello Pat and Mothur team,
My case is a little bit different from the other folks that I could see on this thread.
I have 26 pairs of fastq files and a different combination of barcodes in each one. Only one barcode combination within each R1-R2 pair is of my interest. The rest is from other projects that I am not involved with.
The following is my oligo file:
############################
$ cat oligos-map4mothur.tab
primer CAGCMGCCGCGGTAATWC CCGTCAATTCCTTTRAGGTT 519F926R
BARCODE ggtac aggaa BfleaM1
BARCODE ggtac gagact NC014M10
BARCODE ggtac cgattcc FOF012M11
BARCODE ggtac tctcaatc DS006M12
BARCODE ggtac gagtgg XfleaM2
BARCODE ggtac ccacgtc FfleaM3
BARCODE ggtac ttctcagc SfleaM4
BARCODE ggtac ctagg NOfleaM5
BARCODE ggtac tgctta SAC023M6
BARCODE ggtac gcgaagt FOF007M7
BARCODE ggtac aatcctat DS024M8
BARCODE ggtac atctg SAC004M9
BARCODE caacac aggaa SAC015M13
BARCODE caacac gagact NC036M22
BARCODE caacac cgattcc SAC001M23
BARCODE caacac tctcaatc SAC002M24
BARCODE caacac gagtgg DS027M14
BARCODE caacac ccacgtc DS021M15
BARCODE caacac ttctcagc DS015M16
BARCODE caacac ctagg DS016M17
BARCODE caacac tgctta DS040M18
BARCODE caacac gcgaagt DS022M19
BARCODE caacac aatcctat NC027M20
BARCODE caacac atctg NC028M21
BARCODE atcggtt aggaa SAC003M25
BARCODE atcggtt gagtgg SAC005M26
############################################
I ran make.contigs from mothur v.1.39.1 like the following:
mothur > make.contigs(file=flea.txt, oligos=oligos-map4mothur.tab, bdiffs=1, pdiffs=2, checkorient=t, processors=7)
It ran without errors. However, by what was printed on the screen, I am afraid it is considering the whole fastq content as my samples, instead of only the sequences containing the barcodes depicted in the oligo file for each sample.
############################################
It took 2327 secs to process 5327581 sequences.
Group count:
BfleaM1 120317
DS006M12 334566
DS015M16 10615
DS016M17 19446
DS021M15 9775
DS022M19 13734
DS024M8 295246
DS027M14 20810
DS040M18 7933
FOF007M7 321275
FOF012M11 261574
FfleaM3 204135
NC014M10 257191
NC027M20 14456
NC028M21 14559
NC036M22 10184
NOfleaM5 304042
SAC001M23 10311
SAC002M24 12102
SAC003M25 26277
SAC004M9 242164
SAC005M26 21569
SAC015M13 12304
SAC023M6 328508
SfleaM4 308685
XfleaM2 241776
Total of all groups is 3423554
Output File Names:
flea.trim.contigs.fasta
flea.trim.contigs.qual
flea.contigs.report
flea.scrap.contigs.fasta
flea.scrap.contigs.qual
flea.contigs.groups
[WARNING]: your sequence names contained ‘:’. I changed them to ‘_’ to avoid problems in your downstream analysis.
####################################################
Can anyone shed a light on how mothur would consider only the sequences with the target-barcodes (on the oligo file) within each fastq pair?
JFYI
I've also tried running for one sample only and got the following error:
##########################################################
mothur > make.contigs(ffastq=M1_S1_L001_R1_001.fastq.gz, rfastq=M1_S1_L001_R2_001.fastq.gz, oligos=oligos-map4mothur-M1.tab, bdiffs=1, pdiffs=2, checkorient=t, processors=7)
Using 7 processors.
x�vyK�b���٥sӪ��’%AR1��n�LI����bØѶ is in your forward fastq file and not in your reverse file, please remove it using the remove.seqs command before proceeding.
is in your forward fastq file and not in your reverse file, please remove it using the remove.seqs command before proceeding.
�l*���+=������V��u���g�55S�6v^r�ui�����C������csS%��!�i0�?�ttQTQ��j���1�"�S"!�<�J�8�5G�B$�X<'R��h is in your forward fastq file and not in your reverse file, please remove it using the remove.seqs command before proceeding.
p4�����"� is in your forward fastq file and not in your reverse file, please remove it using the remove.seqs command before proceeding.
�Yʖ͖f�r�덼�9x��I��$�G’m�t&���~ is in your forward fastq file and not in your reverse file, please remove it using the remove.seqs command before proceeding.
�-2�J���i-$ is in your forward fastq file and not in your reverse file, please remove it using the remove.seqs command before proceeding.
��’ is in your forward fastq file and not in your reverse file, please remove it using the remove.seqs command before proceeding.
##########################################################
Here’s the oligo file for the M1 sample only:
$ cat oligos-map4mothur-M1.tab
primer CAGCMGCCGCGGTAATWC CCGTCAATTCCTTTRAGGTT 519F926R
BARCODE ggtac aggaa BfleaM1
BARCODE tgattgac tctcaatc Blank
BARCODE ggtac tctcaatc D1
BARCODE caacac tctcaatc D13
BARCODE atcggtt tctcaatc D25
Looking forward to hear from you,
Thanks very much,
Elton