Most sequences end up in the scrap fasta file

lemv24 · January 27, 2020, 5:44pm

Hello, good morning.
I am helping a colleague to analize MiSeq data. I have 12 libraries encompassing 50 pooled samples each. The data was generated targeting the V3-V4 (341F - 805R) region and using barcodes to mix the experimental samples.

I built the oligos file for each of the libraries since the SOP established at the lab, so I did the make.contigs step for each individual library. My problem is that once the command runs, a varying % of sequences are demultiplexed and merged into contigs successfully, but most of them end in the scrap file (e.g: 5224 seqs in contigs.fasta vs 1392786 seqs in scrap; 25131 vs 1616841, …). So I intuit the oligos file works since the alignment of reads is being effective for some of the 50 samples per pool.

Both primers and barcodes are paired. Here is an example of how my oligos file looks like:

Screen Shot 2020-01-27 at 5.36.00 PM

Interestingly, for 10 out of the 12 pools, barcode #50 is the one that has more success making contigs. Do you think it could be an issue with how I built the oligos file? I ran the following command:

make.contigs(ffastq=/home/lemv/fastq/FASTQ/Mos1_S4_L001_R1_001.fastq,rfastq=/home/lemv/fastq/FASTQ/Mos1_S4_L001_R2_001.fastq, oligos=/home/lemv/Oligos/oligosMos1.file, checkorient=t, processors=16)

When I check the scrap codes generated, it seems like the reads are not aligned cause of missmatches in both barcodes and primers:

Ruk2_S2_L001_R1_001.scrap.contigs.fasta

M00485_502_000000000-CFK5Y_1_1113_15046_22480 | bf(bf) ee=1.21853 fbdiffs=1000(noMatch), rbdiffs=1000(noMatch) fpdiffs=16(noMatch), rpdiffs=1002(noMatch

I was considering using the strategy suggested in the following thread:
https://forum.mothur.org/t/all-sequences-in-scrap-after-make-contigs/20253

I don’t think the sequencing facility used linkers or adapters, so I dont know why some barcode-primer combinations are working (barcode # 50 consistently being the best). Could it be a problem with the quality of the reads and not a computational mistake on my side?

Many thanks for the help!

Luis

westcott · January 27, 2020, 8:11pm

Hi Luis,
Welcome to the mothur community! I would like to help you resolve this issue. Could you send your input files and logfile to mothur.bugs@gmail.com so I can take a closer look?
Thanks,
Sarah

lemv24 · January 30, 2020, 5:38pm

Hello Sarah, good morning. I have sent the files to the gmail address. Please let me know if you are able to help me figure out where I may be getting the demultiplexing step wrong.

Thanks!!

Luis

system · February 9, 2020, 5:38pm

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Problem for demultiplexing Miseq Paired end reads using make.contigs with oligos file Commands in mothur	3	590	May 20, 2019
barcodes and primers only partially removed during make.contigs mothur bugs	4	1526	March 13, 2018
Problem with Make.contigs Commands in mothur	9	5455	January 14, 2016
Three small issues with make.contigs() on MiSeq data Commands in mothur	7	8562	June 13, 2013
make.contigs ERROR... Attempt at demultiplexing unsuccessful Commands in mothur	1	3101	November 19, 2014

Most sequences end up in the scrap fasta file

Related topics