Hi,
I recently received data back from our first run switching from the Nextera i5 and i7 indexes to the IDT UD indexes from Illumina and have a problem when i run make.contigs.
We have been using mothur for a long time with the old indexes and get a consistent amplicon length of ~260 bp using V4 primers.
Our new run (2 runs on separate flow cells on different days of the week sequenced using XLEAP for the first time ~Q38 quality) ends up with contigs that are 360bp.
I wrote a script that matches the 260bp from an old data set to the same sequences in the new data and found the position of the 260bp to start after 50 bases and end 50 bases before the end of the sequence.
Looking at what these 50 bases are I can see the adapter sequences and the index sequences and few extra bases at the start and end.
I tried using trimoverlap=T but this only takes the extra bases off not the adapters and the contigs end up still at 360bp. I have tried to get cutadapt to trim before processing the files but cant seem to get that to work correctly.
Does anyone have any ideas why mothur used to process these quite happily and now I have these extra sequences in, is it something to do with the UD indexes? There are a lot of polymer sequences (as you would expect) when these arent removed.
Thanks!