Help with contigs

allan_santos · July 28, 2023, 6:29pm

Dear all,
i’ve performed a sequencing of V3-V4 region using a v2 kit Illumina 150x2 (paired-end) which the original amplicon size is around 630 bp. Theoretically, I could not make contigs with overlapping region, right? since the size of reads is much smaller than the original amplicon.
How can I proceed here in Mothur after applying “make.file” indicating R1 and R2 from each sample?

Initially, I’ve performed the make.contigs which resulted the following output:

	     Start	    End        NBases       Ambigs     Polymer       NumSeqs

Minimum: 1 35 35 0 2 1
2.5%-tile: 1 157 157 0 4 199596
25%-tile: 1 249 249 0 4 1995954
Median: 1 288 288 2 5 3991907
75%-tile: 1 296 296 7 6 5987860
97.5%-tile: 1 300 300 28 7 7784217
Maximum: 1 302 302 55 150 7983812
Mean: 1 264 264 5 4

of unique seqs: 7983812

total # of seqs: 7983812

Can I still use this result and go on with the analysis?
Thank you so much in advance.

pschloss · July 31, 2023, 5:55pm

I’d suggest only using the first read and running it through the phylotype-based pipeline. With no overlap between the reads, things like alignment won’t make sense. Can you regenerate the data using the V4 region with 2x250 nt reads?

Pat

allan_santos · July 31, 2023, 11:17pm

Hi Pat,
thank you for your reply.
Where can I find this phylotype-based pipeline to have a look?

Unfortunately, due a budget lacking we can not obtain new data for v4 region using other sequencing kit. Because of this I’m quite worried for using this data. That is the only information we have.
Thanks once again

pschloss · August 1, 2023, 7:15pm

You would need to adapt the phylotype-based approach found in the MiSeq SOP…

Pat

allan_santos · August 1, 2023, 7:38pm

In the MiSeq SOP I could find the following sentence
## Phylotype-based analysis
Phylotype-based analysis is the same as OTU-based analysis, but at a different taxonomic scale. We will leave you on your own to replicate the OTU-based analyses described above with the phylotype data

So it seems we could use the same steps as described previously for OTU analysis but at a different taxonomic scale. Sorry my lack of knowledge, but how to solve it? Then I can’t perform the overlapping of reads using “make.contigs” and follow with R1 reads to “screen.seqs” removing homopolymers, ambiguous reads, etc?

Thanks once again

pschloss · August 3, 2023, 7:10pm

I’d suggest taking R1 and doing something like using screen.seqs/chop.seqs to trim the sequences to a common length (perhaps 200 nt) in place of make.contigs and then running them through the rest of the pipeline.

allan_santos · August 8, 2023, 1:21pm

Thank you very much Pat!!

allan_santos · August 9, 2023, 10:53am

Hi Pat,
me again.

in this case, considering only R1 from a 2x150 pb sequencing of V3-V4 region, would you recommend classify.seqs by using OTU (97%) or ASV, and why?
thank you so much

pschloss · August 10, 2023, 6:08pm

I would recommend classifying your sequences using classify.seqs and then pooling things with the same family or genus. The data will be too low quality to trust them as 97% OTUs or ASVs.

allan_santos · August 11, 2023, 12:06am

Hey Pat,
thanks for that.
then, I will use only the file “wang.tax.summary” as output of classify.seqs, and not proceeding to the next steps from SOP? The different samples will get different sequences number. Can I make subsample of them for comparisons?

system · August 21, 2023, 12:06am

This topic was automatically closed 10 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
make.contigs creates contigs that are too long Commands in mothur	5	4081	January 15, 2014
make.contigs - did not assemble Commands in mothur	2	1049	January 12, 2017
Questions about how to "make.contigs" works Theory behind mothur	11	1181	October 14, 2021
make.contigs vs trim.seqs using illumina Commands in mothur	8	7344	February 12, 2014
some puzzles of the command "make contigs" Commands in mothur	11	6831	July 11, 2014

Help with contigs

of unique seqs: 7983812

Related topics