<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[组装 assembly]]></title><description><![CDATA[<p dir="auto"><a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5503144/" rel="nofollow ugc">https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5503144/</a><br />
Phased Diploid Genome Assembly with Single Molecule Real-Time Sequencing<br />
most assemblers output a “mosaic” genome sequence that arbitrarily alternates between parental alleles<br />
大多数组装软件随意拼接了等位基因。<br />
为什么人是2倍体，但是参考基因组只有一组？<br />
<a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4305238/" rel="nofollow ugc">https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4305238/</a><br />
Extending reference assembly models<br />
Employing a clone-based approach, the sequence of each clone represented a single haplotype from a given donor. At clone boundaries, however, haplotypes could switch abruptly, creating a mosaic structure. This design introduced errors within regions of complex structural variation, when sequences unique to one haplotype prevented construction of clone overlaps.<br />
Many alignment and analysis tools penalize reads that align to more than one location under the assumption that the location of these reads cannot be resolved owing to paralogous sequences in the genome. These tools do not distinguish allelic duplication, added by the alternative loci, from paralogous duplication found in the genome, thus confounding repeat and mappability calculations, paired-end placements and downstream interpretation of alignments in regions with alternative loci.<br />
软件在比对时 无法区分reads到底比对到了哪个同源序列上</p>
<p dir="auto"><a href="https://en.wikipedia.org/wiki/Reference_genome#:~:text=As%20they%20are%20assembled%20from,DNA%20sequences%20from%20each%20donor" rel="nofollow ugc">https://en.wikipedia.org/wiki/Reference_genome#:~:text=As they are assembled from,DNA sequences from each donor</a>.<br />
As they are assembled from the sequencing of DNA from a number of individual donors, reference genomes do not accurately represent the set of genes of any single individual organism. Instead a reference provides a haploid mosaic of different DNA sequences from each donor.</p>
<p dir="auto"><a href="https://www.bio-itworld.com/news/2014/06/30/the-hunt-for-a-new-human-reference-genome" rel="nofollow ugc">https://www.bio-itworld.com/news/2014/06/30/the-hunt-for-a-new-human-reference-genome</a></p>
]]></description><link>http://an.forum.genostack.com/topic/709/组装-assembly</link><generator>RSS for Node</generator><lastBuildDate>Sat, 13 Jun 2026 12:13:25 GMT</lastBuildDate><atom:link href="http://an.forum.genostack.com/topic/709.rss" rel="self" type="application/rss+xml"/><pubDate>Tue, 28 Jun 2022 11:31:43 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to 组装 assembly on Wed, 13 Jul 2022 02:29:51 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.melbournebioinformatics.org.au/tutorials/tutorials/hybrid_assembly/nanopore_assembly/" rel="nofollow ugc">https://www.melbournebioinformatics.org.au/tutorials/tutorials/hybrid_assembly/nanopore_assembly/</a><br />
从这个文章看 quast可以使用参考基因组进行质控  busco可以使用同系物进行质控</p>
]]></description><link>http://an.forum.genostack.com/post/1674</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1674</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Wed, 13 Jul 2022 02:29:51 GMT</pubDate></item><item><title><![CDATA[Reply to 组装 assembly on Fri, 01 Jul 2022 10:35:10 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.frontiersin.org/articles/10.3389/fpls.2018.01660/full" rel="nofollow ugc">https://www.frontiersin.org/articles/10.3389/fpls.2018.01660/full</a><br />
Current Strategies of Polyploid Plant Genome Sequence Assembly<br />
<img src="/assets/uploads/files/1656668044433-b3073fe2-369b-4a0a-9424-02f4f2da6307-image.png" alt="b3073fe2-369b-4a0a-9424-02f4f2da6307-image.png" class=" img-responsive img-markdown" /><br />
<img src="/assets/uploads/files/1656668062357-9669a283-455a-42ff-9485-07a0e92593f7-image.png" alt="9669a283-455a-42ff-9485-07a0e92593f7-image.png" class=" img-responsive img-markdown" /></p>
<p dir="auto">A reference genome is a digital, linear nucleic acid sequence containing only a single set of chromosomes plus any unanchored heterozygous contigs and/or scaffolds. A reference genome is used to observe variations across different individuals within a species, to study evolution and to aid genome assembly.</p>
]]></description><link>http://an.forum.genostack.com/post/1640</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1640</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 01 Jul 2022 10:35:10 GMT</pubDate></item><item><title><![CDATA[Reply to 组装 assembly on Fri, 01 Jul 2022 09:33:01 GMT]]></title><description><![CDATA[<p dir="auto"><img src="/assets/uploads/files/1656667847847-35053ca0-3d16-4ce8-9a3e-24c0769de794-image.png" alt="35053ca0-3d16-4ce8-9a3e-24c0769de794-image.png" class=" img-responsive img-markdown" /><br />
<a href="https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-017-1911-6" rel="nofollow ugc">https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-017-1911-6</a><br />
Reference-guided de novo assembly approach improves genome reconstruction for related species</p>
]]></description><link>http://an.forum.genostack.com/post/1639</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1639</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 01 Jul 2022 09:33:01 GMT</pubDate></item><item><title><![CDATA[Reply to 组装 assembly on Fri, 01 Jul 2022 09:28:19 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://academic.oup.com/nar/article/44/12/e113/2457531" rel="nofollow ugc">https://academic.oup.com/nar/article/44/12/e113/2457531</a><br />
Redundans: an assembly pipeline for highly heterozygous genomes<br />
<img src="/assets/uploads/files/1656667697853-3aa8c9e4-7c9a-40a2-a5e4-1eedc6fe5858-image.png" alt="3aa8c9e4-7c9a-40a2-a5e4-1eedc6fe5858-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/post/1638</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1638</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 01 Jul 2022 09:28:19 GMT</pubDate></item><item><title><![CDATA[Reply to 组装 assembly on Fri, 01 Jul 2022 09:06:58 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-022-04591-4" rel="nofollow ugc">https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-022-04591-4</a><br />
gcaPDA: a haplotype-resolved diploid assembler<br />
<img src="/assets/uploads/files/1656666417000-92942982-0499-44cf-a023-3d7599a9c336-image.png" alt="92942982-0499-44cf-a023-3d7599a9c336-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/post/1637</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1637</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 01 Jul 2022 09:06:58 GMT</pubDate></item><item><title><![CDATA[Reply to 组装 assembly on Fri, 01 Jul 2022 08:53:55 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.melbournebioinformatics.org.au/tutorials/tutorials/assembly/assembly-protocol/" rel="nofollow ugc">https://www.melbournebioinformatics.org.au/tutorials/tutorials/assembly/assembly-protocol/</a></p>
]]></description><link>http://an.forum.genostack.com/post/1636</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1636</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 01 Jul 2022 08:53:55 GMT</pubDate></item><item><title><![CDATA[Reply to 组装 assembly on Fri, 01 Jul 2022 08:44:58 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.melbournebioinformatics.org.au/tutorials/tutorials/assembly/assembly-background/#de-novo-assembly-with-velvet-and-the-velvet-optimiser" rel="nofollow ugc">https://www.melbournebioinformatics.org.au/tutorials/tutorials/assembly/assembly-background/#de-novo-assembly-with-velvet-and-the-velvet-optimiser</a><br />
<img src="/assets/uploads/files/1656665097277-3b8dccca-b66e-4633-b7e3-4f9c620e91ae-image.png" alt="3b8dccca-b66e-4633-b7e3-4f9c620e91ae-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/post/1635</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1635</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 01 Jul 2022 08:44:58 GMT</pubDate></item><item><title><![CDATA[Reply to 组装 assembly on Fri, 01 Jul 2022 03:14:24 GMT]]></title><description><![CDATA[<p dir="auto">Synteny provides a framework in which conservation of homologous genes and gene order is identified between genomes of different species.<br />
Inferring synteny between genome assemblies: a systematic evaluation</p>
<p dir="auto"><a href="https://www.nature.com/scitable/topicpage/synteny-inferring-ancestral-genomes-44022/" rel="nofollow ugc">https://www.nature.com/scitable/topicpage/synteny-inferring-ancestral-genomes-44022/</a><br />
<img src="/assets/uploads/files/1656645239038-e6066e1e-7016-4073-8ec4-0ac1b443955b-image.png" alt="e6066e1e-7016-4073-8ec4-0ac1b443955b-image.png" class=" img-responsive img-markdown" /><br />
<img src="/assets/uploads/files/1656645246238-4360e495-f9ff-45c5-94bf-5020f166e708-image.png" alt="4360e495-f9ff-45c5-94bf-5020f166e708-image.png" class=" img-responsive img-markdown" /><br />
对synteny的解释</p>
]]></description><link>http://an.forum.genostack.com/post/1631</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1631</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 01 Jul 2022 03:14:24 GMT</pubDate></item><item><title><![CDATA[Reply to 组装 assembly on Fri, 01 Jul 2022 02:46:23 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4083584/" rel="nofollow ugc">https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4083584/</a><br />
Genetic anchoring of whole-genome shotgun assemblies</p>
<p dir="auto">The process of assigning chromosomal locations to contigs of an assembly is referred to as anchoring.</p>
]]></description><link>http://an.forum.genostack.com/post/1630</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1630</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 01 Jul 2022 02:46:23 GMT</pubDate></item><item><title><![CDATA[Reply to 组装 assembly on Tue, 28 Jun 2022 12:47:52 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://gigascience.biomedcentral.com/articles/10.1186/2047-217X-1-18" rel="nofollow ugc">https://gigascience.biomedcentral.com/articles/10.1186/2047-217X-1-18</a><br />
SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler</p>
<p dir="auto"><a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2813482/" rel="nofollow ugc">https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2813482/</a><br />
De novo assembly of human genomes with massively parallel short read sequencing<br />
<img src="/assets/uploads/files/1656419986553-79ebf8d4-6491-4e5b-a5b0-a6c41df531db-image.png" alt="79ebf8d4-6491-4e5b-a5b0-a6c41df531db-image.png" class=" img-responsive img-markdown" /></p>
<p dir="auto">(iv) merging the bubbles that were caused by repeats or heterozygotes of diploid chromosomes.<br />
soapdenovo（其他软件类似）有个步骤就是合并杂合 所以组装出来的就是单倍体</p>
<p dir="auto">Merging bubbles<br />
We used Dijkstra's algorithm to detect bubbles, which is similar to the “Tour-bus” method in Velvet. We merged the detected bubbles into a single path if the sequences of the parallel paths were very similar; that is, only had a single base pair difference or had fewer than four base pairs difference with &gt;90% identity.</p>
]]></description><link>http://an.forum.genostack.com/post/1610</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1610</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Tue, 28 Jun 2022 12:47:52 GMT</pubDate></item><item><title><![CDATA[Reply to 组装 assembly on Tue, 28 Jun 2022 12:26:39 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022571/" rel="nofollow ugc">https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022571/</a><br />
A graph-based approach to diploid genome assembly<br />
However, across the short, long and hybrid categories, most assemblers require collapsing the two genome sequences of a diploid sample into a single haploid ‘consensus’ sequence (or primary contig). The consensus sequence is obtained by merging the distinct alleles at regions of heterozygosity into a single allele, and therefore losing a lot of information. The resulting haploid de novo assembly does not represent the true characteristics of the diploid input genome.</p>
]]></description><link>http://an.forum.genostack.com/post/1609</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1609</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Tue, 28 Jun 2022 12:26:39 GMT</pubDate></item><item><title><![CDATA[Reply to 组装 assembly on Tue, 28 Jun 2022 12:08:33 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://genomebiology.biomedcentral.com/articles/10.1186/s13059-019-1774-4" rel="nofollow ugc">https://genomebiology.biomedcentral.com/articles/10.1186/s13059-019-1774-4</a><br />
Is it time to change the reference genome?<br />
a consensus genome represents the most common alleles and variants within a population<br />
<img src="/assets/uploads/files/1656418100839-5097c16f-d4d2-420c-8537-4150e1b78970-image.png" alt="5097c16f-d4d2-420c-8537-4150e1b78970-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/post/1608</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1608</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Tue, 28 Jun 2022 12:08:33 GMT</pubDate></item><item><title><![CDATA[Reply to 组装 assembly on Tue, 28 Jun 2022 11:31:57 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3227110/" rel="nofollow ugc">https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3227110/</a><br />
Assemblathon 1: A competitive assessment of de novo short read assembly methods</p>
]]></description><link>http://an.forum.genostack.com/post/1607</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1607</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Tue, 28 Jun 2022 11:31:57 GMT</pubDate></item></channel></rss>