Merging two mothur runs?

rkubin · August 24, 2019, 6:54pm

I am performing mothur runs on fastq coming from two separate datasets. I am using one dataset to train an ML model and another to cross validate the model. In order to do this, I need to make sure that the features the models are trained on (OTUs) are only the OTUs that both runs have in common. How should I go about this ? Should I do one run for everything and then separate the OTU table as I please after ? Or is there some method to merge two OTU tables, keeping only the OTUs that are in common.

To further complicate things, one dataset is 100bp single reads while the other is paired end 250bp.

Any ideas ?
Thanks

pschloss · August 29, 2019, 12:18pm

Hi there,

At this point, the best option would be to build a single OTU table and then separate things. Combining data with different read lengths (and/or different regions) is generally a bad idea. One option would be to use the phylotype/make.shared commands to bin your sequences at the genus or any other taxonomic level rather than using OTUs.

Pat

rkubin · August 29, 2019, 1:46pm

Hi Pat,
Thanks for answering!
I will take this into consideration.

Topic		Replies	Views
Combining taxonomy table fro different dataset Commands in mothur	6	1196	January 31, 2017
Merge otu tables from the denovo otu picking Commands in mothur	1	1476	January 29, 2015
Joining two mothur work flows, is it possible? Commands in mothur	7	1704	August 18, 2017
comparing 2 treatment groups Commands in mothur	1	2242	January 28, 2014
Too much data, a way to combine outputs? Commands in mothur	1	1816	December 11, 2013

Merging two mothur runs?

Related topics