Hi all,
I am using the sub.sample() command to subsample a subset of MiSeq sequeces (which is always too big to process for OTU analysis, I have 1043161 unique reads after denoising and chimera check, I only want around 500-1500 unique reads).
My command looks like this:
sub.sample(fasta=all.otu.unique.fasta, name=all.otu.names, group=all.otu.groups, taxonomy=all.otu.rdp.taxonomy.80,size=500)
Where the fasta file contains all unique reads, name file contains the names of reads that are the same, group file contains which sample each reads belongs to, and taxonomy file contains the taxonomy assignment for each reads (not unique reads).
After I run this command, I got outputs and error message:
Sampling 500 from 13879970.
Deconvoluting subsampled fasta file...
/******************************************/
Running command: unique.seqs(fasta=all.otu.unique.subsample.fasta)
500 331
Output File Names:
all.otu.unique.subsample.names
all.otu.unique.subsample.unique.fasta
/******************************************/
Done.
[ERROR]: M01246_29_000000000-A461D_1_1101_13732_3258 is missing, please correct.
[ERROR]: M01246_29_000000000-A461D_1_1101_14913_3491 is missing, please correct.
[ERROR]: M01246_29_000000000-A461D_1_1101_20287_3764 is missing, please correct.
This is not a complete list of error message.
I wasn’t sure what it is talking about saying something is missing, so I checked using the very first read.
It is in the original name file, also in the group file, and taxonomy file. But it is not in the fasta file, since it is a repeated read. So how it is missing, if the read name is in the input file I provided? I am wondering whether I am doing something wrong? Should I use the not-unique fasta in this command?
However, I did get output files that seem OK:
Output File Names:
all.otu.subsample.names
all.otu.rdp.taxonomy.subsample.80
all.otu.unique.subsample.fasta
all.otu.subsample.groups
But this error message bothers me. I am afraid this would have some affect on the subsampled reads. Is anyone having the same error message? What is the affect if I am just using the output files here, ignoring the error message? I am welcome to any suggestions.
PS, I tried with different versions of mothur,they all give the same error message.
Thank you,
Eddi