I want to generate a biom formated file to use in PICRUST.
I tried to generate subsampled shared file with sub.sample(shared=sample.an.0.03.abund.pick.shared) commend and put it as an input file to generate biom.
I used,
make.biom(shared=sample.an.0.03.abund.pick.shared, label=0.03, reftaxonomy=gg_5_99.gg.tax, constaxonomy=sample.an.0.03.abund.pick.0.03.cons.taxonomy, picrust=97_otu_map.txt)
It generated a biom file and a biom_shared file, but when I checked the biom_shared file, the number of total sequences were different from the subsampled shared file.
I think the reason is because I used constaxonomy file which was generated based on non-subsampled file.
If I generate a biom file with non-subsampled shared file, is it possible to subsample the biom file?
Or is there any other way to generate constaxonomy file with shared file? (because only the biom_shared file has the information of greengene ID)
As far as i know, you don’t need to subsample your shared file to make .biom for picrust. The normalization can be done with picrust i.e. normalize_by_copy_number.py. Not sure but please correct me if i am wrong.
Ousama
(ps: the same for .biom to use with LEfSe, as the normalization can be done with LEfSE i.e. divind by total and x 1000000
Hi, I have read contrasting comments about inputing a rarefied otu table into picrust. Can anybody comment on this?
If so, is it possible to make a subsample constaxonomy file? I was trying with: