<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[物种注释]]></title><description><![CDATA[<ul>
<li>
<p dir="auto">kmer-based read classifiers<br />
Kraken　Kraken ２, Opal　 CLARK, and MetaOthello</p>
</li>
<li>
<p dir="auto">alignment-based methods, for complete genomes or signature or marker genes<br />
Megan　MetaPhlan　 GASiC　and MG-RAST</p>
</li>
<li>
<p dir="auto">Bayesian or EM-based estimators.<br />
Bracken, MetaKallisto, and Pathoscope</p>
</li>
</ul>
<p dir="auto"><a href="https://www.nature.com/articles/s41467-019-10934-2" rel="nofollow ugc">https://www.nature.com/articles/s41467-019-10934-2</a><br />
Strain-level metagenomic assignment and compositional estimation for long reads with MetaMaps</p>
]]></description><link>http://an.forum.genostack.com/topic/122/物种注释</link><generator>RSS for Node</generator><lastBuildDate>Sat, 13 Jun 2026 12:35:20 GMT</lastBuildDate><atom:link href="http://an.forum.genostack.com/topic/122.rss" rel="self" type="application/rss+xml"/><pubDate>Fri, 27 Nov 2020 04:05:16 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to 物种注释 on Mon, 18 Apr 2022 12:11:41 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://academic.oup.com/bib/article/19/3/495/2733162" rel="nofollow ugc">https://academic.oup.com/bib/article/19/3/495/2733162</a><br />
Comparing genome versus proteome-based identification of clinical bacterial isolates<br />
<img src="/assets/uploads/files/1650283898709-55e6d3da-3479-42f9-958d-002a12de02df-image.png" alt="55e6d3da-3479-42f9-958d-002a12de02df-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/post/1390</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1390</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 18 Apr 2022 12:11:41 GMT</pubDate></item><item><title><![CDATA[Reply to 物种注释 on Mon, 18 Apr 2022 11:09:22 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.future-science.com/doi/10.2144/000114135" rel="nofollow ugc">https://www.future-science.com/doi/10.2144/000114135</a><br />
TUIT, a BLAST-based tool for taxonomic classification of nucleotide sequences<br />
这个里面提到 16S是人体微生物的进标准 物种注释大部分用RDP数据库分类 但是RDP少了NT中的很多参考序列 因此可以用TUIT做进一步的补充分类。TUIT也可以适用于其他场景。<br />
TUIT 不限于任何特定类型的序列，并且对短至 125 个碱基对的序列保持高特异性水平；它还具有将序列分类到物种级别的能力。<br />
<img src="/assets/uploads/files/1650279501305-f382dca4-d373-4f4e-a294-d1e287930f1a-image.png" alt="f382dca4-d373-4f4e-a294-d1e287930f1a-image.png" class=" img-responsive img-markdown" /><br />
<img src="/assets/uploads/files/1650280160343-b422e648-3f1a-4b30-9120-52ae4a6f6ea6-image.png" alt="b422e648-3f1a-4b30-9120-52ae4a6f6ea6-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/post/1389</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1389</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 18 Apr 2022 11:09:22 GMT</pubDate></item><item><title><![CDATA[Reply to 物种注释 on Wed, 23 Mar 2022 08:15:28 GMT]]></title><description><![CDATA[<p dir="auto"><a href="/assets/uploads/files/1648023327597-current-protocols-2021-ba-c-diamond-megan-fast-and-easy-taxonomic-and-functional-analysis-of-short-and-long.pdf">Current Protocols - 2021 - Ba c - DIAMOND MEGAN  Fast and Easy Taxonomic and Functional Analysis of Short and Long.pdf</a></p>
]]></description><link>http://an.forum.genostack.com/post/1300</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1300</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Wed, 23 Mar 2022 08:15:28 GMT</pubDate></item><item><title><![CDATA[Reply to 物种注释 on Sun, 20 Mar 2022 16:05:07 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.ncbi.nlm.nih.gov/labs/pmc/articles/PMC5978398/" rel="nofollow ugc">https://www.ncbi.nlm.nih.gov/labs/pmc/articles/PMC5978398/</a><br />
BLAST-based validation of metagenomic sequence assignments<br />
用blast对宏基因的序列物种鉴定（例如kraken）进行校正</p>
]]></description><link>http://an.forum.genostack.com/post/1292</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1292</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Sun, 20 Mar 2022 16:05:07 GMT</pubDate></item><item><title><![CDATA[Reply to 物种注释 on Tue, 01 Dec 2020 11:53:03 GMT]]></title><description><![CDATA[<p dir="auto"><a href="http://ccb.jhu.edu/software/choosing-a-metagenomics-classifier/" rel="nofollow ugc">http://ccb.jhu.edu/software/choosing-a-metagenomics-classifier/</a></p>
]]></description><link>http://an.forum.genostack.com/post/203</link><guid isPermaLink="true">http://an.forum.genostack.com/post/203</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Tue, 01 Dec 2020 11:53:03 GMT</pubDate></item><item><title><![CDATA[Reply to 物种注释 on Fri, 27 Nov 2020 08:00:13 GMT]]></title><description><![CDATA[<p dir="auto"><a href="http://currents.plos.org/treeoflife/index.html%3Fp=395.html" rel="nofollow ugc">http://currents.plos.org/treeoflife/index.html%3Fp=395.html</a><br />
将NCBI和wiki联系起来</p>
]]></description><link>http://an.forum.genostack.com/post/189</link><guid isPermaLink="true">http://an.forum.genostack.com/post/189</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 27 Nov 2020 08:00:13 GMT</pubDate></item><item><title><![CDATA[Reply to 物种注释 on Fri, 27 Nov 2020 07:38:55 GMT]]></title><description><![CDATA[<p dir="auto">NCBI taxonomy databases<br />
<a href="https://www.uppmax.uu.se/resurser/databases/ncbi-taxonomy-databases/" rel="nofollow ugc">https://www.uppmax.uu.se/resurser/databases/ncbi-taxonomy-databases/</a><br />
Name	Source	Notes<br />
taxdump	NCBI	NCBI taxonomic database, in multiple .dmp files (see taxdump_readme.txt or link)<br />
taxcat	NCBI	NCBI taxonomic categories, in categories.dmp (see taxcat_readme.txt or link)<br />
taxdump_readme.txt	NCBI	NCBI taxdump file description<br />
taxcat_readme.txt	NCBI	NCBI taxcat file description<br />
gi_taxid_nucl.dmp	NCBI	Mappings of nucleotide GI to taxid (DEPRECATED)<br />
gi_taxid_prot.dmp	NCBI	Mappings of protein GI to taxid (DEPRECATED)<br />
nucl_wgs.accession2taxid	NCBI	TaxID mapping for nucleotide records of type WGS or TSA<br />
nucl_gb.accession2taxid	NCBI	TaxID mapping for nucleotide records not of the above types<br />
prot.accession2taxid	NCBI	TaxID mapping for protein records<br />
pdb.accession2taxid	NCBI	TaxID mapping for PDB protein records<br />
dead_nucl.accession2taxid	NCBI	TaxID mapping for dead nucleotide records<br />
dead_prot.accession2taxid	NCBI	TaxID mapping for dead protein records<br />
dead_wgs.accession2taxid	NCBI	TaxID mapping for dead WGS or TSA records</p>
]]></description><link>http://an.forum.genostack.com/post/188</link><guid isPermaLink="true">http://an.forum.genostack.com/post/188</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 27 Nov 2020 07:38:55 GMT</pubDate></item><item><title><![CDATA[Reply to 物种注释 on Fri, 27 Nov 2020 06:31:58 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://jshleap.github.io/bioinformatics/writting-jMEGAN_notes/" rel="nofollow ugc">https://jshleap.github.io/bioinformatics/writting-jMEGAN_notes/</a><br />
Introduction to MEGAN methods</p>
]]></description><link>http://an.forum.genostack.com/post/187</link><guid isPermaLink="true">http://an.forum.genostack.com/post/187</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 27 Nov 2020 06:31:58 GMT</pubDate></item><item><title><![CDATA[Reply to 物种注释 on Fri, 27 Nov 2020 04:32:15 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://webcache.googleusercontent.com/search?q=cache:fJfbDasHie8J:https://nickilottmetagenomics.wordpress.com/2015/01/19/comparing-diamond-lca-against-kraken/+&amp;cd=10&amp;hl=en&amp;ct=clnk&amp;gl=us" rel="nofollow ugc">https://webcache.googleusercontent.com/search?q=cache:fJfbDasHie8J:https://nickilottmetagenomics.wordpress.com/2015/01/19/comparing-diamond-lca-against-kraken/+&amp;cd=10&amp;hl=en&amp;ct=clnk&amp;gl=us</a><br />
DIAMOND + LCA outperforms Kraken in the sensitivity stakes by ~10 fold when assigning reads from mouse gut to genera. Accuracy of these assignments is not assessed, however.<br />
DIAMOND+LCA</p>
<p dir="auto">DIAMOND (version 0.3.9) was run using the following command:</p>
<p dir="auto">diamond blastx –db /ifs/mirror/diamond/nr –query &lt;input.fastq&gt; -v 2 –threads 16 -o &lt;diamond.output.tsv&gt;</p>
<p dir="auto">LCA mapper (from mtools, MEGAN5) was run using:</p>
<p dir="auto"><a href="http://lcamapper.sh" rel="nofollow ugc">lcamapper.sh</a> -i &lt;diamond.output.tsv&gt; -f Detect -ms 50 -me 0.01 -tp 50 -gt megan/gi_taxid_prot.bin -o &lt;lca.output&gt;</p>
]]></description><link>http://an.forum.genostack.com/post/186</link><guid isPermaLink="true">http://an.forum.genostack.com/post/186</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 27 Nov 2020 04:32:15 GMT</pubDate></item><item><title><![CDATA[Reply to 物种注释 on Fri, 27 Nov 2020 06:03:47 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://besjournals.onlinelibrary.wiley.com/doi/full/10.1111/2041-210X.13095" rel="nofollow ugc">https://besjournals.onlinelibrary.wiley.com/doi/full/10.1111/2041-210X.13095</a><br />
Identification of the taxonomic origin of a DNA sequence is crucial for many sequencing projects, e.g. metagenomics studies, identification of contaminations in whole genome sequencing projects and filtering of organisms of interest in marker‐gene based community analyses.<br />
Last common ancestor algorithms are powerful approaches to estimate the taxonomy of a given sequence and have been widely used for classification of next‐generation sequencing (NGS) reads, also known as 2nd generation sequencing reads.<br />
Here, we present BASTA (<a href="https://github.com/timkahlke/BASTA" rel="nofollow ugc">https://github.com/timkahlke/BASTA</a>), a basic sequence taxonomy annotator, which extends last common ancestor estimations from sequencing reads to any kind of nucleotide or amino acid sequence utilizing NCBI taxonomies of user‐defined best hits.<br />
BASTA can be configured to use the output of many common sequence comparison tools, e.g. BLAST and Diamond, in conjunction with either provided or user‐defined target sequence databases.<br />
<img src="/assets/uploads/files/1606457024144-6253df26-2c42-4a10-bfb1-c34e1f1e1da2-image-resized.png" alt="6253df26-2c42-4a10-bfb1-c34e1f1e1da2-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/post/185</link><guid isPermaLink="true">http://an.forum.genostack.com/post/185</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 27 Nov 2020 06:03:47 GMT</pubDate></item><item><title><![CDATA[Reply to 物种注释 on Fri, 27 Nov 2020 04:10:07 GMT]]></title><description><![CDATA[<p dir="auto"><a href="http://ccb.jhu.edu/software/choosing-a-metagenomics-classifier/" rel="nofollow ugc">http://ccb.jhu.edu/software/choosing-a-metagenomics-classifier/</a><br />
How to Choose Your Metagenomics Classification Tool</p>
]]></description><link>http://an.forum.genostack.com/post/184</link><guid isPermaLink="true">http://an.forum.genostack.com/post/184</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 27 Nov 2020 04:10:07 GMT</pubDate></item></channel></rss>