Joining two mothur work flows, is it possible?

toneta · July 13, 2017, 7:30am

I have two data sets for a large number of samples (ca 350), one sequencing DNA and the other cDNA from the same samples. I want to have the possibility to compare the two datasets at the OTU-level, so I at least need to do the clustering of all samples together. However, as the DNA samples have already been processed through mothur, I’m now wondering if I have to start from the very beginning with all the 700 samples (DNA and cDNA) or if it is possible to merge the analysis at some point. Has anyone got some clever ideas on how to handle this? Happy for any suggestions
Best,
Tone

pschloss · July 13, 2017, 11:30am

You would need to do them together. Be forewarned that the process of generating cDNA probably has a very high error rate relative to your PCR and sequencing error rates.

Pat

Kendra · July 21, 2017, 2:48pm

you can process up through chimera checking independently then cat each of the pairs of files together (cat cDNA.fasta DNA.fasta…)

toneta · July 27, 2017, 10:07am

Thank you! I will try that one - it will save a lot of work and time

mniku · August 10, 2017, 11:08am

Hi, I’m also interested in this. I don’t quite understand how we can join data using just cat. The fasta files of course yes, but how can we join the count table? If I understand correctly, it includes the essential information on how many times each sequence was found in each sample. Hmmmm or could I just transpose it, so that samples are in rows instead of columns?

dwaite · August 10, 2017, 10:48pm

The newer versions of mothur have a command to merge count tables together (here).

mniku · August 18, 2017, 8:03am

Great, thanks!

mniku · August 18, 2017, 12:40pm

Unfortunately this doesn’t seem to work as I thought. It’s not capable of REALLY merging count tables, when they have identical sequences. I mean, I have count tables from different samples but partially identical sequences and I’m trying to join these for further processing.

The same problem with joining the fasta files, although this is simpler: is there any tool to join fasta files while removing duplicates?

Topic		Replies	Views
Combining sequence datasets Theory behind mothur	5	1443	December 16, 2018
Command for combining replicate samples Commands in mothur	3	1247	June 7, 2017
When to combine data from two batches? Theory behind mothur	6	24	August 11, 2024
merge 2 count tables Commands in mothur	4	1899	August 9, 2016
Combining taxonomy table fro different dataset Commands in mothur	6	1192	January 31, 2017

Joining two mothur work flows, is it possible?

Related topics