To whom it may concern,
Background
I am interested in seeing how the count of OTUs and taxonomic resolution change as the quality trimming from the end of sample reads changes. I speculate that more trimming from the read ends will result in more spurious OTUs and lower taxonomic resolution.
I wanted to do this with data that I know should theoretically have the same sequences (and assumptions of how those sequences were generated) as my reference database. Since I typically have used Green Genes, I have randomly sampled sequences from Green Genes and mapped quality scores that drop n nucleotides from the end of both the forward and reverse read. I will iterate across n=0 through n=N, where N is the length of the longest sequence in Green Genes. Sequences shorter than n will be removed from downstream analysis.
Description of Problem
I have encountered an issue with make.contigs(), which gives me a few warnings of One of your candidate sequences is longer than you longest template sequence. Your longest template sequence is 1000. […], and then it gives a Segmentation fault.
Question
Does this indicate that make.contigs() will make template sequences no longer than 1000 nt long even when candidate sequences are longer?
mothur > make.contigs(file=stability7.files, processors=8)
Using 8 processors.
Reading fastq data...
10000
20000
30000
40000
50000
60000
70000
80000
90000
100000
Done.
>>>>> Processing insilico_sample7_100000seqs_drop0_L001_R1_001.variable_cut.0ffastatemp (file 1 of 1) <<<<<
Making contigs...
One of your candidate sequences is longer than you longest template sequence. Your longest template sequence is 1000. Your candidate is 1366.
One of your candidate sequences is longer than you longest template sequence. Your longest template sequence is 1000. Your candidate is 1353.
One of your candidate sequences is longer than you longest template sequence. Your longest template sequence is 1000. Your candidate is 1355.
One of your candidate sequences is longer than you longest template sequence. Your longest template sequence is 1000. Your candidate is 1358.
One of your candidate sequences is longer than you longest template sequence. Your longest template sequence is 1000. Your candidate is 1467.
One of your candidate sequences is longer than you longest template sequence. Your longest template sequence is 1000. Your candidate is 1504.
One of your candidate sequences is longer than you longest template sequence. Your longest template sequence is 1000. Your candidate is 1390.
One of your candidate sequences is longer than you longest template sequence. Your longest template sequence is 1000. Your candidate is 1453.
Segmentation fault