<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[比对过程涉及的参数]]></title><description><![CDATA[<p dir="auto">Soft-clipping</p>
<p dir="auto">primary alignment<br />
Secondary alignments<br />
<img src="/assets/uploads/files/1658482212237-c5231bce-4030-4b1d-9bbb-2b9177deaeed-image.png" alt="c5231bce-4030-4b1d-9bbb-2b9177deaeed-image.png" class=" img-responsive img-markdown" /><br />
A <strong>secondary alignment</strong> occurs when a given read could align reasonably well to more than one place. One of the possible reported alignments is termed <strong>"primary"</strong> and the others will be marked as "secondary".</p>
<p dir="auto"><strong>Unmapped reads</strong> are in the BAM file but have no valid assigned position (N.B., they may have an assigned position, but it should be ignored). It's typically the case that a number of reads can't be aligned, due to things like sequencing errors, imperfect matches between the DNA sequenced and the reference, random e. coli or other contamination, etc..</p>
<p dir="auto">supplemental alignments.</p>
<p dir="auto"><a href="https://yulijia.net/en/bioinformatics/2015/12/21/Linear-Chimeric-Supplementary-Primary-and-Secondary-Alignments.html#fn:1" rel="nofollow ugc">https://yulijia.net/en/bioinformatics/2015/12/21/Linear-Chimeric-Supplementary-Primary-and-Secondary-Alignments.html#fn:1</a><br />
<a href="https://hongiiv.tistory.com/844" rel="nofollow ugc">https://hongiiv.tistory.com/844</a></p>
]]></description><link>http://an.forum.genostack.com/topic/734/比对过程涉及的参数</link><generator>RSS for Node</generator><lastBuildDate>Sat, 13 Jun 2026 14:28:27 GMT</lastBuildDate><atom:link href="http://an.forum.genostack.com/topic/734.rss" rel="self" type="application/rss+xml"/><pubDate>Fri, 22 Jul 2022 09:56:02 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to 比对过程涉及的参数 on Mon, 25 Jul 2022 12:19:34 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://gist.github.com/crazyhottommy/ed73c7e2daee8383dccb35f224f99714" rel="nofollow ugc">https://gist.github.com/crazyhottommy/ed73c7e2daee8383dccb35f224f99714</a><br />
获取唯一比对的序列<br />
samtools view -h my.bam | awk '$17 ~ /XA:/' || $1 ~ /^@/' | samtools view -bS - &gt; my_unique.bam</p>
<p dir="auto"><a href="https://bioinformatics.stackexchange.com/questions/508/obtaining-uniquely-mapped-reads-from-bwa-mem-alignment" rel="nofollow ugc">https://bioinformatics.stackexchange.com/questions/508/obtaining-uniquely-mapped-reads-from-bwa-mem-alignment</a></p>
]]></description><link>http://an.forum.genostack.com/post/1752</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1752</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 25 Jul 2022 12:19:34 GMT</pubDate></item><item><title><![CDATA[Reply to 比对过程涉及的参数 on Sat, 23 Jul 2022 09:51:56 GMT]]></title><description><![CDATA[<p dir="auto"><a href="http://evomics.org/2014/01/alignment-methods/" rel="nofollow ugc">http://evomics.org/2014/01/alignment-methods/</a></p>
]]></description><link>http://an.forum.genostack.com/post/1750</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1750</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Sat, 23 Jul 2022 09:51:56 GMT</pubDate></item><item><title><![CDATA[Reply to 比对过程涉及的参数 on Fri, 22 Jul 2022 10:24:36 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.cnblogs.com/timeisbiggestboss/p/8856888.htmlSecondary" rel="nofollow ugc">https://www.cnblogs.com/timeisbiggestboss/p/8856888.htmlSecondary</a> ,Supplementary alignment 和bwa mem的-M -Y参数<br />
1.supplementary alignment</p>
<p dir="auto">supplementary alignment是指一条read的一部分和参考区域1比对成功，另一部分和参考区域2比对成功，参考区域1和参考区域2没有交集（或很少），那么一条read就会产生两条sam文件，</p>
<p dir="auto">将其中的一条sam文件作为represent alignment，而另一条作为supplementary alignment （flag为2048）。</p>
<p dir="auto">将上面的fastq文件去跑bwa，read有两条sam文件，第二条的flag值为2048：</p>
<p dir="auto">2.bwa mem的-M -Y参数：</p>
<p dir="auto">-M：mark shorter split hits as secondary。就是把supplemenary alignment 变为no primary（flag值256） 。下面是bwa mem -M的结果</p>
<p dir="auto">-Y:use soft clipping for supplementary alignments。把默认的hard clip变为soft clip。hard clip 不会显示不匹配的碱基串，soft clip会显示不匹配的碱基串。下面是bwa mem -Y的结果（58H34M变为58S34M）</p>
<p dir="auto">3.secondary(no primary)是指这条read在基因组上有多个匹配区域（&gt;=2），可以是read是的同一部分有不同匹配区域，也可以是一条read上的不同区域。所以supplemenary aligment应该算是secondary的子集。</p>
<p dir="auto">许多处理bam的软件不会去处理supplemenary（split alignments），比如Picard’s markDuplicates，所以可能需要用-M把supplemenary 转换为secondary。</p>
]]></description><link>http://an.forum.genostack.com/post/1721</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1721</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 22 Jul 2022 10:24:36 GMT</pubDate></item><item><title><![CDATA[Reply to 比对过程涉及的参数 on Fri, 22 Jul 2022 10:05:19 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://biohpc.cornell.edu/lab/doc/Variant_workshop_Part1.pdf" rel="nofollow ugc">https://biohpc.cornell.edu/lab/doc/Variant_workshop_Part1.pdf</a></p>
<p dir="auto"><a href="https://gatk.broadinstitute.org/hc/en-us/articles/360039568932--How-to-Map-and-clean-up-short-read-sequence-data-efficiently" rel="nofollow ugc">https://gatk.broadinstitute.org/hc/en-us/articles/360039568932--How-to-Map-and-clean-up-short-read-sequence-data-efficiently</a><br />
<img src="/assets/uploads/files/1658484318347-58a937a2-29ae-475b-958e-c05c308bca83-image.png" alt="58a937a2-29ae-475b-958e-c05c308bca83-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/post/1720</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1720</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 22 Jul 2022 10:05:19 GMT</pubDate></item></channel></rss>