暗能星系

    • 登录
    • 搜索

    突变分析

    生物信息分析
    1
    7
    4
    正在加载更多帖子
    • 从旧到新
    • 从新到旧
    • 最多赞同
    回复
    • 在新帖中回复
    登录后回复
    此主题已被删除。只有拥有主题管理权限的用户可以查看。
    • A
      anneng 最后由 编辑

      https://sci-hub.st/10.1038/ng.806
      839adc13-6fc0-4841-91af-5bd5699da976-image.png

      1 条回复 最后回复 回复 引用 0
      • A
        anneng 最后由 编辑

        https://www.intechopen.com/chapters/67673
        7aabc5dd-5360-4cc1-81a2-4d2ab362947a-image.png

        1 条回复 最后回复 回复 引用 0
        • A
          anneng 最后由 anneng 编辑

          https://www.sciencedirect.com/science/article/pii/S1879625721000651#bib0070
          Quantitative measures of within-host viral genetic diversity
          0879421d-a41f-43e6-b41b-987716bca08e-image.png
          这个文章对病毒的遗传多样性指标做了详细的介绍 而且有例子 重点看看

          香农熵有两个场景:针对碱基位置的和针对单倍型的
          9a30ddd8-1662-4e53-93c7-37fadd28c9a9-image.png
          ce61a98e-4422-4cd9-b8de-db757cb60cdf-image.png

          1 条回复 最后回复 回复 引用 0
          • A
            anneng 最后由 编辑

            An integrated software for virus community sequencing data analysis
            770227c1-eb9b-4b24-b1bf-f70f74528970-image.png
            一个集成的病毒群体研究软件 也计算了很多指标

            1 条回复 最后回复 回复 引用 0
            • A
              anneng 最后由 编辑

              https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4944299/
              Sequence analysis. Reads were aligned to either a PR8 or a WSN33 reference sequence using Bowtie2 (44). The alignments were sorted, and PCR duplicates were removed using Picard (http://broadinstitute.github.io/picard/). Variants were called using either DeepSNV (26) or LoFreq (28) and filtered using the Pysam module in Python and custom R scripts available for download at https://github.com/lauringlab/Benchmarking_paper. Bases with a Phred score of <30 were masked in the DeepSNV analysis. We connected all of these steps into an analytical pipeline using bpipe (45), which is available for download at https://github.com/lauringlab/variant_pipeline. To save memory during SNV processing, only variants with P values of <0.9 were included in our receiver operating characteristic (ROC) curve analysis, as the vast majority of true negatives are trivial to identify and have a P value of 1. For ease of viewing, and to account for this analytical artifact, we extended the ROC curves horizontally from the last observed change in sensitivity. All the commands required to generate the figures are available for anonymous download at https://github.com/lauringlab/Benchmarking_paper. An interactive Shiny app of our benchmarking work can be downloaded at https://github.com/lauringlab/Benchmarking_shiny.
              Diversity metrics. The Shannon entropy (H) of each genomic position was calculated as follows: H = −Σi = 1nxiln(xi), where xi represents the frequency of the ith allele and n represents the number of alleles found at the given position. Since our data do not represent haplotypes, we report Shannon's entropy as the mean across all genomic positions.
              The L1 norm (L) between 2 populations was calculated as follows: L = Σi = 1n|pi − qi|, where n represents the union of variants between the two samples and pi and qi represent the frequencies of the ith variant in each sample.

              Data set accession number. All raw fastq files have been submitted to the Sequence Read Archive (SRA) under BioProject accession number PRJNA317621.

              1 条回复 最后回复 回复 引用 0
              • A
                anneng 最后由 编辑

                https://academic.oup.com/ve/article/5/1/vey041/5304643?login=false

                1 条回复 最后回复 回复 引用 0
                • A
                  anneng 最后由 编辑

                  https://merenlab.org/2012/05/11/oligotyping-pipeline-explained/
                  b21e9632-ac1a-4c95-9f77-9aa9af85dc4b-image.png

                  1 条回复 最后回复 回复 引用 0
                  • First post
                    Last post
                  Powered by 暗能星系