<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[SAM文件]]></title><description><![CDATA[<p dir="auto">SAM file format<br />
The Sequence Alignment/Map (SAM) is a file format to save alignment information of short reads mapped against reference sequences. It usually starts with a header section followed by alignment information as tab separated lines for each read.</p>
<p dir="auto">Header section<br />
@HD    VN:1.3    SO:coordinate<br />
@SQ    SN:conticA    LN:443<br />
@SQ    SN:contigB    LN:1493<br />
@SQ    SN:contigC    LN:328</p>
<p dir="auto">Tab-delimited read alignment information lines<br />
readID43GYAX15:7:1:1202:19894/1    256    contig43    613960    1    65M    *    0    0    CCAGCGCGAACGAAATCCGCATGCGTCTGGTCGTTGCACGGAACGGCGGCGGTGTGATGCACGGC    EDDEEDEE=EE?DE??DDDBADEBEFFFDBEFFEBCBC=?BEEEE@=:?::?7?:8-6?7?@??#    AS:i:0    XS:i:0  XN:i:0  XM:i:0  XO:i:0  XG:i:0  NM:i:0  MD:Z:65  YT:Z:UU</p>
<p dir="auto">readID43GYAX15:7:1:1202:19894/1    272    contig32    21001    1    65M    *    0    0    GCCGGACGTCACACGGCCGCCGGGCCGGTCTACGACCAGACGCATGCGGATTTCGTTAGAGCCGG    #??@?7?6-8:?7?::?:=@EEEEB?=CBCBEFFEBDFFFEBEDABDDD??ED?EE=EEDEEDDE    AS:i:-5    XS:i:0   XN:i:0  XM:i:1   XO:i:0   XG:i:0   NM:i:1   MD:Z:42T22   YT:Z:UU</p>
<p dir="auto">readID43GYAX15:7:1:1202:19894/1    256    contig87    540849    1    65M    *    0    0    CCTGCACGAACGAAATCCGCATGCGTCTGGTCGTTGTACGGAACGGCGGTTGTGTGACGAACGGC    EDDEEDEE=EE?DE??DDDBADEBEFFFDBEFFEBCBC=?BEEEE@=:?::?7?:8-6?7?@??# AS:i:0  XS:i:0 XN:i:0  XM:i:0 XO:i:0  XG:i:0  NM:i:0    MD:Z:65    YT:Z:UU</p>
<p dir="auto">Meaning of columns</p>
<p dir="auto">QNAME    FLAG    RNAME    POS    MAPQ    CIGAR    RNEXT    PNEXT    TLEN    SEQ    QUAL    TAGS<br />
Read Name<br />
SAM flag --&gt; decode<br />
contig name or * for unmapped<br />
mapped position of base 1 of a read on the reference sequence<br />
MAPQ mapping quality<br />
CIGAR string describing insertions and deletions<br />
Name of mate<br />
Position of mate<br />
Template length<br />
Read Sequence<br />
Read Quality<br />
Additional information in TAG:TYPE:VALUE format</p>
<p dir="auto">一些思考：<br />
SAM作为比对的结果文件，里面也包含了原始序列，这种会很浪费存储空间。fastq加载到数据库后，每一次比对，可以建立一些中间表进行存储，原始序列只保存一份。可以认为比对只是在给原始序列打标签。</p>
]]></description><link>http://an.forum.genostack.com/topic/143/sam文件</link><generator>RSS for Node</generator><lastBuildDate>Sat, 13 Jun 2026 12:35:37 GMT</lastBuildDate><atom:link href="http://an.forum.genostack.com/topic/143.rss" rel="self" type="application/rss+xml"/><pubDate>Sat, 19 Dec 2020 08:36:48 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to SAM文件 on Sat, 19 Dec 2020 08:41:34 GMT]]></title><description><![CDATA[<p dir="auto"><a href="/assets/uploads/files/1608367292928-samv1.pdf">SAMv1.pdf</a></p>
]]></description><link>http://an.forum.genostack.com/post/253</link><guid isPermaLink="true">http://an.forum.genostack.com/post/253</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Sat, 19 Dec 2020 08:41:34 GMT</pubDate></item><item><title><![CDATA[Reply to SAM文件 on Sat, 19 Dec 2020 08:37:05 GMT]]></title><description><![CDATA[<p dir="auto"><a href="http://www.metagenomics.wiki/tools/samtools/bam-sam-file-format#:~:text=The%20Sequence%20Alignment%2FMap%20" rel="nofollow ugc">http://www.metagenomics.wiki/tools/samtools/bam-sam-file-format#:~:text=The Sequence Alignment%2FMap </a>(SAM,separated%20lines%20for%20each%20read.</p>
]]></description><link>http://an.forum.genostack.com/post/252</link><guid isPermaLink="true">http://an.forum.genostack.com/post/252</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Sat, 19 Dec 2020 08:37:05 GMT</pubDate></item></channel></rss>