loss of diversity due to homogenization of sequences

Johannes · March 27, 2014, 12:08pm

I ran unique.seqs before aligning, screening and filtering a concatenated fasta file for the purpose of building a phylogeny. RAxML which I used, flagged a bunch of sequences as identical to other sequences in the file. I didn’t anticipate this since I already had run unique.seqs. It is worth mentioning that I use clone library sequences where each sequence corresponds to an individual genbank entry… I guess after the screening and filtering, some of the remaining overlapping sequences had been truncated at the same position- making them identical.

…Im just thinking of this loss in true diversity due to this homogenization… But I guess it’s the price to pay if you wanna have a somewhat accurate phylogeny.

adamc83 · March 28, 2014, 12:20am

You should run unique.seqs() after every filter.seqs() that can truncate ends. I think several steps of the SOP assume you have uniques only.

Topic		Replies	Views
unique.seqs command Commands in mothur	4	34377	February 11, 2013
The command unique.seqs(fasta=stability.trim.contigs.good.fasta) is suddenly killed Commands in mothur	4	424	August 27, 2021
seqences not in the same lenth mothur bugs	4	5048	October 28, 2014
unique.seqs after filter.seqs Problem mothur bugs	3	1753	October 6, 2015
An explosion of unique seqs Commands in mothur	6	3960	August 3, 2015

loss of diversity due to homogenization of sequences

Related Topics