summary.seqs with fastq to add qual score summary

ejbiers · September 26, 2013, 9:23pm

Always love your mothur!

Would it be relatively easy to add a feature to summary.seqs that allows a fastq input file? I’d love to use that command with a fastq file to get an idea of the range of quality scores directly in the logfile. Something simple like the low and high quality score would be great, but maybe something more complex like a summary of the average quality scores… or the average length at which QS remains above a certain threshold…

Maybe something like this(hypothetical numbers) where QS shows the range of QS over individual reads, aveQS shows the range of average QS, and LenQS shows the range of read lengths at which the base quality score remains above a certain threshold.

[img]Start End NBases Ambigs Polymer NumSeqs QS aveQS LenQS
Minimum: 1 35 35 0 2 1 0 1
2.5%-tile: 1 147 147 0 4 59794 3 100
25%-tile: 1 151 151 0 4 597933 25 130
Median: 1 168 168 0 4 1195866 30 130
75%-tile: 1 190 190 0 5 1793798 30 140
97.5%-tile: 1 193 193 14 6 2331937 35 160
Maximum: 1 303 302 199 151 2391730 40 190
Mean: 1 169.435 169.435 1.44417 4.4614 30.275 130.475

of Seqs: 2391730[/img]

Thoughts? Thanks!

westcott · September 27, 2013, 10:32am

You might like the summary.qual command. http://www.wiki.mothur.org/wiki/Summary.qual

Topic		Replies	Views
Quality summary statistics Commands in mothur	1	2306	October 4, 2011
Bug in trim.seqs mothur bugs	5	62995	December 27, 2012
summary.seqs to evaluate fastq files Feature requests	2	3729	March 18, 2013
Can ''qwindowaverage'' help me ? Commands in mothur	3	2577	September 17, 2014
Expecting quality scores to be les than 40 mothur bugs	3	467	January 31, 2021

summary.seqs with fastq to add qual score summary

of Seqs: 2391730[/img]

Related topics