<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[Genome-wide Association Studies (GWAS)]]></title><description><![CDATA[<p dir="auto"><a href="https://www.sciencedirect.com/science/article/pii/S1018364722000283" rel="nofollow ugc">https://www.sciencedirect.com/science/article/pii/S1018364722000283</a><br />
Genome-wide Association Studies (GWAS) are conducted to identify single nucleotide polymorphisms (variants) associated with a phenotype within a specific population.</p>
<p dir="auto">The primary purpose of machine learning algorithms is to define a function<br />
which predicts an unknown phenotype based on a genotype observation  based on sample data</p>
<p dir="auto">A major reason for the adoption of machine learning algorithms is that they are well suited for developing predictive models when the number of features is larger than the number of samples.</p>
<p dir="auto">A standard characteristic of GWAS results is that the number of attributes (p) greatly outnumber the number of sample points (n). This is usually described as the curse of dimensionality or the large p and small n problem.Ideally, it is a problem for classical multivariate regression.</p>
<p dir="auto"><img src="/assets/uploads/files/1682663241320-901b8e1d-049e-4627-bd4d-ec8c37694015-image.png" alt="901b8e1d-049e-4627-bd4d-ec8c37694015-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/topic/863/genome-wide-association-studies-gwas</link><generator>RSS for Node</generator><lastBuildDate>Sat, 13 Jun 2026 12:30:52 GMT</lastBuildDate><atom:link href="http://an.forum.genostack.com/topic/863.rss" rel="self" type="application/rss+xml"/><pubDate>Fri, 28 Apr 2023 06:43:49 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to Genome-wide Association Studies (GWAS) on Fri, 28 Apr 2023 07:45:49 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.nature.com/articles/s41598-019-46649-z" rel="nofollow ugc">https://www.nature.com/articles/s41598-019-46649-z</a><br />
Comparative performances of machine learning methods for classifying Crohn Disease patients using genome-wide genotyping data</p>
]]></description><link>http://an.forum.genostack.com/post/2090</link><guid isPermaLink="true">http://an.forum.genostack.com/post/2090</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 28 Apr 2023 07:45:49 GMT</pubDate></item><item><title><![CDATA[Reply to Genome-wide Association Studies (GWAS) on Fri, 28 Apr 2023 06:53:28 GMT]]></title><description><![CDATA[<p dir="auto"><img src="/assets/uploads/files/1682664235371-d1b9d6dc-25d0-4fb4-a5d5-89cb9bd0a2eb-image.png" alt="d1b9d6dc-25d0-4fb4-a5d5-89cb9bd0a2eb-image.png" class=" img-responsive img-markdown" /><br />
上面这篇文章 使用关键词 (Machine learning) AND (genome-wide association studies) 搜索后 还进行了人工过滤  过滤的条件包括：<br />
1.不能是review articles类型的文章  <a href="https://www.sciencedirect.com/" rel="nofollow ugc">https://www.sciencedirect.com/</a> 的文章本身就是分类的 这个可能在发表的时候就打上了标签<br />
<img src="/assets/uploads/files/1682664416728-fc65b8c2-04f5-4209-8d48-42f5a720077a-image.png" alt="fc65b8c2-04f5-4209-8d48-42f5a720077a-image.png" class=" img-responsive img-markdown" /></p>
<p dir="auto">2.必须研究的是“人”  这个就涉及到语义搜索或者实体识别 能推断某个文章的研究对象包括人 把这个需求推广一下  科学家可能会搜索 某篇文章研究了某个基因？ 某篇文章研究了某个蛋白？某篇文章使用了ML方法？某个研究领域的文献？</p>
]]></description><link>http://an.forum.genostack.com/post/2089</link><guid isPermaLink="true">http://an.forum.genostack.com/post/2089</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 28 Apr 2023 06:53:28 GMT</pubDate></item></channel></rss>