qiime2的物种注释研究
-
https://microbiomejournal.biomedcentral.com/articles/10.1186/s40168-018-0470-z
Optimizing taxonomic classification of marker-gene amplicon sequences with QIIME 2’s q2-feature-classifier plugin
qiime2的分类器使用的是q2-feature-classifier插件 这个插件包括了如下几个功能:
classify-consensus-blast BLAST+ consensus taxonomy classifier
classify-consensus-vsearch VSEARCH-based consensus taxonomy classifier
classify-hybrid-vsearch-sklearn ALPHA Hybrid classifier: VSEARCH exact match + sklearn classifierclassify-sklearn Pre-fitted sklearn-based taxonomy classifier
extract-reads Extract reads from reference sequences.
fit-classifier-naive-bayes Train the naive_bayes classifier
fit-classifier-sklearn Train an almost arbitrary scikit-learn classifier这些功能主要分为三类:
1.基于机器学习的machine-learning
2.基于对齐的alignment-based
从代码看qiime的对齐方案用了blast+lca 和我们的病原的方案一样
blast后面看看怎么优化
3.构建相关的数据库 -
eDNA需要部署的流程和数据库:
1.denoise+blast物种注释
2.denoise+bayes物种注释
3.nextflow 全流程
3.denoise+rdp物种注释(这个不需要 classify-sklearn和RDP类似)
4.训练分类数据库:makeblastdb、qiime2_train
5.BOLD数据库、NT数据库、mitofish、silva、unite
6.qiime2的全流程
7.交互式分析工具、blast server
8.nextflow hifi 全长