What is the meaning of increasing numbers from classify.seqs?

weu · May 12, 2021, 1:10pm

Hi, I am running classify.seqs using Ezbiocloud edited v3v4 reference sequences, but classifying sequences took a very long time (currently running more than 24 hours) and the numbers showing are still increasing. May I know what does this mean? And is there a way to predict the duration? If my laptop went to sleep does the process get hindered? Thank you.

pschloss · May 13, 2021, 5:46pm

Hi,

How many unique sequences do you have? If you’re sequencing V3-V4 or aren’t following our MiSeqSOP, you might want to check out this post…

Pat

weu · May 14, 2021, 4:04am

There are 194520 sequences classified from classify.seqs command. Im using Macbook Air M1 chip, but couldn’t find any details of comparison on RAM or so for mothur analysis. I did follow the SOP as much as I can understand and added large=T as well as putting taxlevel=3. May I know what are the use of those different dist files produced?

pschloss · May 14, 2021, 4:51pm

That’s a lot of uniques. What region are you sequencing? I suspect you have so many because of low sequence quality from sequencing V3-V4 or V4-V5 (i.e. something longer than ~250 nt). I certainly wouldn’t try to process this on a laptop. The large=T option should not be used and you probably want to use taxlevel=5 or 6 and possibly increase the diffs value in pre.cluster to the integer that is just less than your sequence length divided by 100,

Pat

weu · May 15, 2021, 2:03pm

Yes its V3-V4 region.Thousand thanks for all great suggestions, everything worked fine after increasing diffs to 5, and changed taxlevel to 6. I also changed the reference database to silvaseed instead of the larger silva reference sequences, may I know if size of the reference database affects?

pschloss · May 17, 2021, 5:25pm

the seed version should be good enough for most purposes

pat

system · May 27, 2021, 5:26pm

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Dist.seqs taking too much time Commands in mothur	3	476	May 5, 2022
Problem with OTU classification mothur bugs	5	5652	April 19, 2010
Stuck at clustering, its running for more than a week Commands in mothur	6	446	January 10, 2024
dis.seqs issue? Commands in mothur	1	2007	May 7, 2014
Dist.seqs Taking Really Long	6	1129	January 11, 2022

What is the meaning of increasing numbers from classify.seqs?

Related topics