filter.seqs removes every column

Chris · December 20, 2011, 9:51am

We have sequenced 16S amplicons via Illumina platform and are now analysing our sequences. In filter.seqs step every column is removed. I have tried different criterias in screen.seqs, but it still ends the same. For now I have only used vertical=T parameter, but it would be great to remove dots also. Is there anything that I should try or keep in mind?

Thanks,
Chris

pschloss · December 22, 2011, 3:23pm

Can you post the results of summary.seqs for the input to screen.seqs?

Chris · January 3, 2012, 8:53am

My input to screen.seqs:

Start End NBases Ambigs Polymer NumSeqs
Minimum: 0 0 0 0 1 1
2.5%-tile: 1044 1056 5 0 2 1045
25%-tile: 21917 22545 48 0 3 10446
Median: 31189 34102 74 0 3 20891
75%-tile: 31189 34113 75 0 3 31336
97.5%-tile: 42573 43061 77 0 4 40737
Maximum: 43116 43116 128 0 8 41781
Mean: 26207.9 28138.7 60.867 0 3.17367

of Seqs: 41781

95% of the sequences are between 74-76 bases, so these are quite short and I’m not so sure that aligning against Silva reference alignment is the right choice here. Maybe Greengenes would be better?

pschloss · January 3, 2012, 1:55pm

The problem is that your sequences do not overlap with each other - I doubt greengenes will be better. I’d suggest using start=31189, end=34000 in screen.seqs and then running filter.seqs.

Chris · January 9, 2012, 7:50am

Thank you for the advice, it helped a lot. But another question, what reference database for classification would be the best to use for so short sequences?

pschloss · January 10, 2012, 6:50pm

Give the RDP trainset or the greengenes one a shot. Just don’t expect them to classify your data all the way to family or genus.

Topic		Replies	Views
Loss of bases with filter.seqs Commands in mothur	1	2145	February 22, 2012
filter.seqs removes every columns Commands in mothur	1	1432	February 8, 2016
filter.seqs removes all data Commands in mothur	10	6747	January 25, 2016
filter.seqs Commands in mothur	4	4010	August 2, 2012
filter.seqs error mothur bugs	3	1164	January 30, 2017

filter.seqs removes every column

of Seqs: 41781

Related topics