Phylogenetic methods in large dataset

awil · February 3, 2020, 4:04pm

Hi,
I’m doing phylogenetic method for large dataset following Miseq SOP to analyse Faith’s phylogenetic diversity (phylo.diversity command).

However, I got 539G in dist-phylip format. Now, the process of clearcut is running more than 7 days. It’s still not finish. I’m not sure that how should I do for big file in clearcut process. Could you please suggest me?

Thank you very much.

Kendra · February 3, 2020, 4:05pm

how much ram do you have?

awil · February 3, 2020, 4:07pm

I used ram 16G (mem total 757G. When i used processors=16, %mem used in 40%) , the step of process in below;

dist.seqs(fasta=finalforphy.fasta, output=lt, processors=16)
clearcut(phylip=finalforphy.phylip.dist)

However, we had used other server in slurm system (10 nodes, 3TB mem/node). I set memory in 1TB (72 cores), but I have only five days for running job (server usage limitation). The job did not still complete.

Kendra · February 4, 2020, 6:46pm

I believe clearcut needs as much ram as the dist matrix for each processor. But frankly, I’ve never gotten a big dataset to finish clearcut. I just don’t do it.

pschloss · February 5, 2020, 12:38pm

For what it’s worth, we’ve yet to see results from using phylogenetic methods disagree with those from bin-based methods. I would really suggest just using a bin-based method.

Pat

awil · February 5, 2020, 2:57pm

No. problem. I also think that using phylogenetic tree did not optimal method for large dataset. Thank you very much for your suggestion. Normally, I used bin-based method to analyse all data. However, I got some suggestion in phylogenetic method using Faith’s phylogenetic diversity which maybe give the additional insights into data ^^

pschloss · February 6, 2020, 6:15pm

For us, PD is almost exactly correlated with the number of OTUs observed in a sample

Pat

system · February 16, 2020, 6:15pm

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
clearcut running 16+ days Commands in mothur	7	2058	May 2, 2017
Troubles with clearcut in MiSeq SOP Commands in mothur	2	973	November 20, 2017
building really huge trees Theory behind mothur	1	3604	September 25, 2014
Termination of Clearcut command Commands in mothur	2	411	July 23, 2021
Phylogenetic Tree File	1	351	July 25, 2023

Phylogenetic methods in large dataset

Related topics