<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[变异分析]]></title><description><![CDATA[<p dir="auto">1.变异的类型<br />
<a href="https://www.garvan.org.au/research/kinghorn-centre-for-clinical-genomics/learn-about-genomics/for-gp/genetics-refresher-1/types-of-variants" rel="nofollow ugc">https://www.garvan.org.au/research/kinghorn-centre-for-clinical-genomics/learn-about-genomics/for-gp/genetics-refresher-1/types-of-variants</a><br />
<img src="/assets/uploads/files/1660183707430-06bc575b-0423-42a6-89db-f7fd887fb1ad-image.png" alt="06bc575b-0423-42a6-89db-f7fd887fb1ad-image.png" class=" img-responsive img-markdown" /></p>
<p dir="auto"><img src="/assets/uploads/files/1660183716789-42cadb50-7031-4723-b314-7d4d0a68db0e-image.png" alt="42cadb50-7031-4723-b314-7d4d0a68db0e-image.png" class=" img-responsive img-markdown" /></p>
<p dir="auto"><img src="/assets/uploads/files/1660183726184-beefe0ab-d368-431f-b8d9-993673bcd8b4-image.png" alt="beefe0ab-d368-431f-b8d9-993673bcd8b4-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/topic/750/变异分析</link><generator>RSS for Node</generator><lastBuildDate>Sat, 13 Jun 2026 13:53:29 GMT</lastBuildDate><atom:link href="http://an.forum.genostack.com/topic/750.rss" rel="self" type="application/rss+xml"/><pubDate>Thu, 11 Aug 2022 03:01:46 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to 变异分析 on Mon, 15 Aug 2022 08:26:07 GMT]]></title><description><![CDATA[<p dir="auto"><img src="/assets/uploads/files/1660551958720-0bc2cc59-a0da-4811-9299-de078db3a44e-image.png" alt="0bc2cc59-a0da-4811-9299-de078db3a44e-image.png" class=" img-responsive img-markdown" /><br />
<a href="https://seqone.com/science/" rel="nofollow ugc">https://seqone.com/science/</a></p>
]]></description><link>http://an.forum.genostack.com/post/1798</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1798</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 15 Aug 2022 08:26:07 GMT</pubDate></item><item><title><![CDATA[Reply to 变异分析 on Fri, 12 Aug 2022 12:33:05 GMT]]></title><description><![CDATA[<p dir="auto">CEPH 1463家系数据<br />
<a href="https://catalog.coriell.org/0/Sections/Collections/NIGMS/CEPHFamiliesDetail.aspx?PgId=441&amp;fam=1463&amp;" rel="nofollow ugc">https://catalog.coriell.org/0/Sections/Collections/NIGMS/CEPHFamiliesDetail.aspx?PgId=441&amp;fam=1463&amp;</a></p>
<p dir="auto"><a href="https://www.illumina.com/platinumgenomes.html" rel="nofollow ugc">https://www.illumina.com/platinumgenomes.html</a></p>
<p dir="auto"><a href="https://console.cloud.google.com/storage/browser/genomics-public-data/platinum-genomes/vcf?pageState=(%22StorageObjectListTable%22:(%22f%22:%22%255B%255D%22)" rel="nofollow ugc">https://console.cloud.google.com/storage/browser/genomics-public-data/platinum-genomes/vcf?pageState=("StorageObjectListTable":("f":"%255B%255D")</a>)&amp;prefix=&amp;forceOnObjectsSortingFiltering=false</p>
]]></description><link>http://an.forum.genostack.com/post/1796</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1796</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 12 Aug 2022 12:33:05 GMT</pubDate></item><item><title><![CDATA[Reply to 变异分析 on Wed, 17 Aug 2022 06:24:18 GMT]]></title><description><![CDATA[<p dir="auto">CNV</p>
<p dir="auto"><a href="https://gatk.broadinstitute.org/hc/en-us/articles/360035531452-After-gCNV-calling-considerations" rel="nofollow ugc">https://gatk.broadinstitute.org/hc/en-us/articles/360035531452-After-gCNV-calling-considerations</a><br />
<a href="https://gatk.broadinstitute.org/hc/en-us/articles/360035531152" rel="nofollow ugc">https://gatk.broadinstitute.org/hc/en-us/articles/360035531152</a><br />
<img src="/assets/uploads/files/1660276185338-38539c97-e87a-4514-ab3a-c5905d7b9596-image.png" alt="38539c97-e87a-4514-ab3a-c5905d7b9596-image.png" class=" img-responsive img-markdown" /></p>
<p dir="auto"><a href="https://www.biorxiv.org/content/10.1101/2021.04.30.442110v1.full" rel="nofollow ugc">https://www.biorxiv.org/content/10.1101/2021.04.30.442110v1.full</a><br />
A comparison of tools for copy-number variation detection in germline whole exome and whole genome sequencing data<br />
<img src="/assets/uploads/files/1660276891643-715615c1-4cdc-40c0-8d74-32ad701e2314-image.png" alt="715615c1-4cdc-40c0-8d74-32ad701e2314-image.png" class=" img-responsive img-markdown" /></p>
<p dir="auto">cnvpytor 验证记录<br />
小服务器  /ceph_disk3/var_demo_data</p>
<pre><code>sudo apt-get install python-tk
pip3 install  cnvpytor -i https://pypi.tuna.tsinghua.edu.cn/simple
用conda的话  需要把源码中的这些文件复制下
cp CNVpytor/cnvpytor/data/*.pytor /opt/miniconda3/lib/python3.7/site-packages/cnvpytor/data/
samtools index NA12877_S1.bam
cnvpytor -root 12877.pytor -rd NA12877_S1.bam
cnvpytor -root 12877.pytor -his 1000 10000 100000
cnvpytor -root 12877.pytor -partition 1000 10000 100000
cnvpytor -root 12877.pytor -call 1000 10000 100000

如下步骤为可选步骤 在使用snp时运行（CWL、WDL加开关控制）
/opt/miniconda3/bin/cnvpytor -root 12877.pytor -snp NA12877_S1.genome.vcf -sample NA12877
/opt/miniconda3/bin/cnvpytor -root 12877.pytor -pileup NA12877_S1.bam
/opt/miniconda3/bin/cnvpytor -root 12877.pytor -mask_snps
/opt/miniconda3/bin/cnvpytor -root 12877.pytor -baf 1000 10000 100000
/opt/miniconda3/bin/cnvpytor -root 12877.pytor -call baf 1000 10000 100000
注意下面这个命令需要一个文件来描述范围
/opt/miniconda3/bin/cnvpytor -root 12877.pytor -genotype 1000 10000 100000 &lt;regions
/opt/miniconda3/bin/cnvpytor -root 12877.pytor -call combined 1000 10000 100000
后续还有几个命令来输出图片  等报告一起做
</code></pre>
<p dir="auto">错误：</p>
<pre><code>Traceback (most recent call last):
  File "/home/anneng/.local/bin/cnvpytor", line 11, in &lt;module&gt;
    sys.exit(main())
  File "/home/anneng/.local/lib/python2.7/site-packages/cnvpytor/__main__.py", line 437, in main
    use_gc_corr=not args.no_gc_corr, use_mask=args.use_mask_with_rd)
  File "/home/anneng/.local/lib/python2.7/site-packages/cnvpytor/root.py", line 1502, in call
    distN = np.zeros_like(NN, dtype="long") - 1
  File "/home/anneng/.local/lib/python2.7/site-packages/numpy/core/numeric.py", line 168, in zeros_like
    res = empty_like(a, dtype=dtype, order=order, subok=subok)
TypeError: data type "long" not understood
需要使用python3 运行
</code></pre>
]]></description><link>http://an.forum.genostack.com/post/1795</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1795</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Wed, 17 Aug 2022 06:24:18 GMT</pubDate></item><item><title><![CDATA[Reply to 变异分析 on Thu, 11 Aug 2022 11:40:00 GMT]]></title><description><![CDATA[<p dir="auto">对于MNP(即MNVs) GATK当前的版本不支持 我们有2个选择 使用MAC进行纠正  或者使用另外的工具 freebayes官方宣传支持MNP<br />
<a href="https://github.com/freebayes/freebayes" rel="nofollow ugc">https://github.com/freebayes/freebayes</a><br />
MAC最近一直没有维护 建议直接用freebayes</p>
<p dir="auto">freebayes验证 对于多个样本 每个样本都要加上RG头</p>
<pre><code>bwa mem ecoli.fasta SRR10000374_1.fastq.gz SRR10000374_2.fastq.gz -R '@RG\tID:SRR10000374\tSM:SRR10000374' | samtools sort -o SRR10000374.bam -
bwa mem ecoli.fasta SRR10000377_1.fastq.gz SRR10000377_2.fastq.gz -R '@RG\tID:SRR10000377\tSM:SRR10000377' | samtools sort -o SRR10000377.bam -
../freebayes-1.3.6-linux-amd64-static -L list -f ecoli.fasta -v demo.vcf
</code></pre>
<p dir="auto"><a href="https://bioinformaticsworkbook.org/dataAnalysis/VariantCalling/freebayes-dnaseq-workflow.html#gsc.tab=0" rel="nofollow ugc">https://bioinformaticsworkbook.org/dataAnalysis/VariantCalling/freebayes-dnaseq-workflow.html#gsc.tab=0</a></p>
]]></description><link>http://an.forum.genostack.com/post/1793</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1793</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 11 Aug 2022 11:40:00 GMT</pubDate></item><item><title><![CDATA[Reply to 变异分析 on Thu, 11 Aug 2022 07:21:43 GMT]]></title><description><![CDATA[<p dir="auto"><a href="/assets/uploads/files/1660202476687-journal.pone.0262574.pdf">journal.pone.0262574.pdf</a><br />
<img src="/assets/uploads/files/1660202502031-1996ecf9-1022-4342-93cf-a120cb908c2e-image.png" alt="1996ecf9-1022-4342-93cf-a120cb908c2e-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/post/1792</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1792</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 11 Aug 2022 07:21:43 GMT</pubDate></item><item><title><![CDATA[Reply to 变异分析 on Thu, 11 Aug 2022 05:54:04 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.ensembl.org/info/genome/variation/prediction/classification.html" rel="nofollow ugc">https://www.ensembl.org/info/genome/variation/prediction/classification.html</a><br />
<img src="/assets/uploads/files/1660197236146-d0526b6a-39c3-40f0-8b2a-a64e7bd7d46f-image.png" alt="d0526b6a-39c3-40f0-8b2a-a64e7bd7d46f-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/post/1791</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1791</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 11 Aug 2022 05:54:04 GMT</pubDate></item><item><title><![CDATA[Reply to 变异分析 on Thu, 11 Aug 2022 04:08:40 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://gatk.broadinstitute.org/hc/en-us/articles/360035530752-What-types-of-variants-can-GATK-tools-detect-or-handle-" rel="nofollow ugc">https://gatk.broadinstitute.org/hc/en-us/articles/360035530752-What-types-of-variants-can-GATK-tools-detect-or-handle-</a></p>
<p dir="auto">GATK and Picard variant manipulation tools are currently able to recognize the following types of alleles:</p>
<p dir="auto">SNP (single nucleotide polymorphism)<br />
INDEL (insertion/deletion)<br />
MIXED (combination of SNPs and indels at a single position)<br />
MNP (multi-nucleotide polymorphism, e.g. a dinucleotide substitution)<br />
SYMBOLIC (such as the &lt;NON-REF&gt; allele used in GVCFs produced by HaplotypeCaller, the * allele used to signify the presence of a spanning deletion, or undefined events like a very large allele or one that's fuzzy and not fully modeled; i.e. there's some event going on here but we don't know what exactly)</p>
]]></description><link>http://an.forum.genostack.com/post/1790</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1790</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 11 Aug 2022 04:08:40 GMT</pubDate></item><item><title><![CDATA[Reply to 变异分析 on Thu, 11 Aug 2022 04:08:03 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4521406/" rel="nofollow ugc">https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4521406/</a><br />
MAC: identifying and correcting annotation for multi-nucleotide variations<br />
<a href="https://github.com/leiwei-bioinfo/MAC" rel="nofollow ugc">https://github.com/leiwei-bioinfo/MAC</a></p>
]]></description><link>http://an.forum.genostack.com/post/1789</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1789</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 11 Aug 2022 04:08:03 GMT</pubDate></item><item><title><![CDATA[Reply to 变异分析 on Thu, 11 Aug 2022 07:17:17 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6957021/" rel="nofollow ugc">https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6957021/</a><br />
Misannotation of multiple-nucleotide variants risks misdiagnosis<br />
<img src="/assets/uploads/files/1660189354008-934ae468-4041-402c-9a68-79dbbd5d1ec5-image.png" alt="934ae468-4041-402c-9a68-79dbbd5d1ec5-image.png" class=" img-responsive img-markdown" /></p>
<p dir="auto">To investigate whether using alternative tools results in correct annotation of MNVs, we re-processed the VCF file of simulated MNVs using GATK 3.6.0 ReadBackedPhasing 10 (default parameters plus “-maxDistMNP 2 -enableMergeToMNP”) or MAC 1.2 9 then annotated the resulting VCF files using Alamut batch version 1.5.2 (Interactive Biosoftware, Rouen, France). We also tested re-calling the variants using VarDict 1.4 7 and Platypus 0.8.1 12.</p>
<p dir="auto">GATK新版本已经没有了 ReadBackedPhasing 工具<br />
However, they do not emit MNPs. If you would like to combine contiguous SNPs into MNPs, you will need to use the legacy ReadBackedPhasing tool in GATK3 with the MNP merging function activated. See the GATK3 tool documentation for details.<br />
<a href="https://gatk.broadinstitute.org/hc/en-us/articles/360035530752-What-types-of-variants-can-GATK-tools-detect-or-handle-" rel="nofollow ugc">https://gatk.broadinstitute.org/hc/en-us/articles/360035530752-What-types-of-variants-can-GATK-tools-detect-or-handle-</a></p>
]]></description><link>http://an.forum.genostack.com/post/1788</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1788</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 11 Aug 2022 07:17:17 GMT</pubDate></item><item><title><![CDATA[Reply to 变异分析 on Thu, 11 Aug 2022 03:33:37 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7253413/#:~:text=Multi%2Dnucleotide%20variants%20(MNVs),of%20the%20individual%20variants3" rel="nofollow ugc">https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7253413/#:~:text=Multi-nucleotide variants (MNVs),of the individual variants3</a>.<br />
Landscape of multi-nucleotide variants in 125,748 human exomes and 15,708 genomes<br />
Multi-nucleotide variants (MNVs) are defined as clusters of two or more nearby variants existing on the same haplotype in an individual1,2 (Fig. 1a). When variants in an MNV are found within the same codon, the overall impact may differ from the functional consequences of the individual variants3.<br />
<img src="/assets/uploads/files/1660187719129-368dd461-ac62-429d-bf6d-4a658cb5a505-image.png" alt="368dd461-ac62-429d-bf6d-4a658cb5a505-image.png" class=" img-responsive img-markdown" /></p>
<p dir="auto">Identification of MNVs requires the constituent variants to be properly <strong>phased</strong>—that is, to be identified accurately as either both occurring on the same haplotype (in cis) or on two different haplotypes (in trans). Phasing can be performed following three broad strategies: read-based phasing18, which assesses whether nearby variants co-segregate on the same reads in DNA sequencing data; family-based phasing19, which assesses whether pairs of variants are co-inherited within families; and population-based phasing20, which leverages haplotype sharing between members of a large genotyped population to make a statistical inference of phase. Read-based phasing is particularly effective for pairs of nearby variants, making it suitable for the analysis of MNVs.</p>
]]></description><link>http://an.forum.genostack.com/post/1787</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1787</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 11 Aug 2022 03:33:37 GMT</pubDate></item></channel></rss>