When I run the following command:
filter.seqs(fasta=H51WFGV01.shhh.trim.unique.good.align, vertical=T, trump=., processors=8)
I get the following output:
Length of filtered alignment: 0
Number of columns removed: 50000
Length of the original alignment: 50000
Number of sequences used to construct filter: 5089
The input align file looks OK. Any ideas on what’s happening?
Your sequences are not overlapping, over 75% end before the remainder start. What parameter options did you run with the screen.seqs command? Can you try: screen.seqs(fasta=yourFasta, name=yourName, group=yourGroup, optimize=start), and post the results of the summary.seqs command again?
Want to try screen.seqs(fasta=H51WFGV01.shhh.trim.unique.align, name=H51WFGV01.shhh.trim.unique.names, group=H51WFGV01.shhh.groups, optimize=start-end, processors=8)? Can you also post the summary.seqs results on the H51WFGV01.shhh.trim.unique.fasta file?
The results with optimize=start-end are overlapping and filter.seqs should no longer remove all your sequences. I asked for the wrong file in my last post, I meant the summary.seqs output for fasta=H51WFGV01.shhh.trim.unique.align. Looking at the results from the summary.seqs command may help us pick a better start and end value.
Length of filtered alignment: 0
Number of columns removed: 50000
Length of the original alignment: 50000
Number of sequences used to construct filter: 18684
I think you are having a problem with your alignment. Are you sure that your sequences are oriented in the correct direction? You might try doing flip=T in align.seqs and try again.