Steps needed to repeat for the analysis of the subset

irsat · May 29, 2022, 1:46pm

Hello everyone,

I want to pick up a subset of the bigger dataset that I have profiled (Clustering, OTU annotation vs) for another analysis. Should I just simply get the profiles of the subset from the exiting profile file of the bigger dataset? or I should do the profiling process all over again for the subset samples? if I need to do the profiling again, from which steps should I repeat the profiling for the subset? clustering? chimera removal? Or should i just start from the make.contigs step?

Thank you

pschloss · May 31, 2022, 7:30pm

I’m not 100% what you mean, but if you are going to compare these sequences/OTUs to another set of sequences/OTUs then you’ll want to go back before the align.seqs step.

Pat

irsat · June 1, 2022, 12:38pm

Hi Pat,
Thank you for the reply. Let’s say I have 6 profiled samples in total separated into two groups, such as A, B, C versus D, E, F. And now I want to compare A, B versus D, E, F, or A, B versus E, F. All the new comparisons are for the samples from the initial profiled samples. there is no newly introduced sample.
best

By the way, do you think that the results from the mothur 1.40 differ very much from the ones from mothur 1.48 ?

pschloss · June 1, 2022, 3:50pm

As I understand it, you’ve already processed samples A, B, C, D, E, and F together and want to make comparisons between those samples. There shouldn’t be a problem extracting the information from summary.single, dist.shared, etc. when those functions are performed on the full dataset.

I always encrouage people to update their versions of mothur. The one you’re using is a few years old at this point.

Pat

irsat · June 1, 2022, 5:17pm

Thank you a lot. This is very helpful.

I was using the galaxy platform, which is very nice for organizing jobs and files. that is why I used the old version which is in the galaxy.

Good day

Alexandre_Thibodeau · June 6, 2022, 12:54pm

Hello!
Well, I would simply add a grouping variable to the samples you want to compare together. In R this is relatively easy but in galaxy I do not know.

irsat · June 6, 2022, 1:49pm

Hi Alexandre,

thank you for your reply. You are right. The profiles of the samples do not differ very much if one repeats the whole process(starting from make.contigs) for the smaller set if just considering the taxonomic abundance (Even though there was a 0.0001 difference at the phylum level when I tried). So one can just add a grouping variable to compare taxonomic profiles between different groups. But the alpha diversity differed significantly. Right now, i am trying to figure out which step is responsible for the difference in the alpha diversity profiles.

best

Irsat

system · June 16, 2022, 1:50pm

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Diversity comparisons between different sized datasets? Theory behind mothur	15	7584	March 18, 2015
How to compare OTU lists Commands in mothur	2	2268	November 13, 2012
Analyzing two different runs separately Theory behind mothur	2	42	December 21, 2025
Comparison studies on amplicon sequence data Commands in mothur	3	664	February 1, 2020
Issues on sub.sampling mothur bugs	5	8107	June 14, 2014

Steps needed to repeat for the analysis of the subset

Related topics