Clustering on tagmented fragments

PS13 · January 27, 2021, 4:09pm

I’m trying to re-analyse some open access data. The authors have used a V1-V4 (I know, this isn’t the best approach at all!) primers and the DNA was tagmented before sequencing. This means that the length of the sequences and start and end positions are highly variable upon alignment. I wanted to use cluster with the vsearch algorithm on the fasta and count files, but was wondering if that might artificially inflate the number of OTUs on tagmented DNA?

If so, are there any good OTU-based alternative approaches to this analysis I could use, or will only something phylotype-based be suitable?

pschloss · January 27, 2021, 7:15pm

Hi,

That sounds like a mess. The methods in mothur really work best when the reads start and end at the same coordinates. Perhaps they were trying to do something like EMIRGE (EMIRGE: reconstruction of full-length ribosomal genes from microbial community short read sequencing data | Genome Biology | Full Text)? I’d check out the tools that have been built around that approach and see if they help.

Pat

Topic		Replies	Views
Different clustering methods Feature requests	1	4379	June 11, 2012
How to cluster the 16s DNA obtained from metagenomic sequenc Commands in mothur	2	2193	December 4, 2013
cluster Theory behind mothur	1	1955	June 29, 2015
ASV analysis and cluser.fragments Theory behind mothur	2	529	February 4, 2022
Cluster sequence into OTUs Theory behind mothur	23	15371	January 9, 2015

Clustering on tagmented fragments

Related topics