Hi mothur community,
Many thanks for maintaining the forum, and tool. I’m a new guy to mothur, and 16s data. Kindly bear with me and my naive questions. :oops:
I’ve illumina- MiSeq, dual indexed, paired end, V3-V4 region, demultiplexed reads. They do not have bar codes in them. I’m yet to receive linker/primer information. Read length is 301.
I took a step ahead with one sample. created stability file, ran mothur. So far so good.
Command ran:
mothur "#summary.seqs(fasta=stability.trim.contigs.fasta)"
Start End NBases Ambigs Polymer NumSeqs
Minimum: 1 35 35 0 3 1
2.5%-tile: 1 443 443 0 4 2597
25%-tile: 1 446 446 0 4 25961
Median: 1 464 464 2 5 51922
75%-tile: 1 464 464 6 5 77883
97.5%-tile: 1 598 598 22 7 101247
Maximum: 1 602 602 45 300 103843
Mean: 1 463.875 463.875 4.49172 4.84329of Seqs: 103843
Clearly “602” didn’t assemble.
screen.seqs(fasta=stability.trim.contigs.fasta, group=stability.contigs.groups, maxambig=0, maxlength=275)
Questions:
- How do I decide the max length parameter I should be using in next step to filter out my sequences?
- Does Maximum: 103843 (NumSeqs) mean I have 10k-ish sequences with length 602?
- Has Mothur added feature to take in .fastq.gz files?
I’ll be grateful to you for helping me proceed.