filter.seqs removing sequences from the fasta file?

svazquez · August 1, 2014, 10:55am

Hi

I am processing a downsampled dataset to find the best parameters for my analysis.
After running screen.seqs I ended up with:

of unique seqs: 8794

total # of seqs: 10129

Then, I ran
mothur > filter.seqs(fasta=454downsample.shhh.trim.unique.good.align, vertical=T, trump=., processors=2)

and the output was:
Length of filtered alignment: 1219
Number of columns removed: 48781
Length of the original alignment: 50000
Number of sequences used to construct filter: 7372

Someone can explain why filter.seqs started with 8749 unique seqs but ended up in a fasta file with 7372 seqs? I thought that filter.seqs would only remove common gaps and missing data but not sequences!

Thanks!

pschloss · August 1, 2014, 8:32pm

I suspect the data you posted were from running summary.seqs on 454downsample.shhh.trim.unique.align and not 454downsample.shhh.trim.unique.good.align

Pat

Topic		Replies	Views
filter.seqs Commands in mothur	4	3922	May 31, 2012
filter.seqs removes all data Commands in mothur	10	6749	January 25, 2016
problems with filter.seqs Commands in mothur	3	2160	March 26, 2015
Filter.seqs what kind of numbers should be being removed? Commands in mothur	2	361	March 14, 2021
Problem with filter.seqs - Length of filtered alignment: 0 Commands in mothur	4	338	June 22, 2023

filter.seqs removing sequences from the fasta file?

of unique seqs: 8794

Related topics