<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[大数据]]></title><description><![CDATA[大数据]]></description><link>http://an.forum.genostack.com/category/27</link><generator>RSS for Node</generator><lastBuildDate>Sat, 13 Jun 2026 13:10:10 GMT</lastBuildDate><atom:link href="http://an.forum.genostack.com/category/27.rss" rel="self" type="application/rss+xml"/><pubDate>Mon, 29 Jan 2024 10:46:34 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[lakefs]]></title><description><![CDATA[<p dir="auto">71794678-875a-4f70-85b7-af01aa4c8691-image.png</p>
]]></description><link>http://an.forum.genostack.com/topic/1038/lakefs</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/1038/lakefs</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 29 Jan 2024 10:46:34 GMT</pubDate></item><item><title><![CDATA[Parabricks验证记录]]></title><description><![CDATA[<p dir="auto"><a href="https://developer.nvidia.com/blog/search-posts/?q=parabricks&amp;faceted_search_industry_str=Healthcare+%26+Life+Sciences" rel="nofollow ugc">https://developer.nvidia.com/blog/search-posts/?q=parabricks&amp;faceted_search_industry_str=Healthcare+%26+Life+Sciences</a><br />
一些应用案例</p>
]]></description><link>http://an.forum.genostack.com/topic/1029/parabricks验证记录</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/1029/parabricks验证记录</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 04 Jan 2024 08:11:13 GMT</pubDate></item><item><title><![CDATA[Open-Source Data Viz with Superset and DuckDB]]></title><description><![CDATA[<p dir="auto"><a href="https://jorritsandbrink.substack.com/p/open-source-data-viz-with-superset" rel="nofollow ugc">https://jorritsandbrink.substack.com/p/open-source-data-viz-with-superset</a><br />
<img src="/assets/uploads/files/1702719601894-d2f2a32a-84fc-4a60-8aff-b7ef4977d41e-image.png" alt="d2f2a32a-84fc-4a60-8aff-b7ef4977d41e-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/topic/1022/open-source-data-viz-with-superset-and-duckdb</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/1022/open-source-data-viz-with-superset-and-duckdb</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Sat, 16 Dec 2023 09:40:11 GMT</pubDate></item><item><title><![CDATA[jupyterlab 远程访问]]></title><description><![CDATA[<p dir="auto"><a href="https://medium.com/mlearning-ai/set-up-remote-jupyter-lab-notebook-server-for-remote-browser-access-2cef464f203e" rel="nofollow ugc">https://medium.com/mlearning-ai/set-up-remote-jupyter-lab-notebook-server-for-remote-browser-access-2cef464f203e</a></p>
]]></description><link>http://an.forum.genostack.com/topic/1021/jupyterlab-远程访问</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/1021/jupyterlab-远程访问</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Sat, 16 Dec 2023 03:29:52 GMT</pubDate></item><item><title><![CDATA[Iceberg + Spark + Trino + Dagster: modern, open-source data stack demo]]></title><description><![CDATA[<p dir="auto">f420072e-9274-4589-a969-01c940376714-image.png<br />
<a href="https://cube.dev/" rel="nofollow ugc">https://cube.dev/</a></p>
]]></description><link>http://an.forum.genostack.com/topic/1017/iceberg-spark-trino-dagster-modern-open-source-data-stack-demo</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/1017/iceberg-spark-trino-dagster-modern-open-source-data-stack-demo</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 14 Dec 2023 07:26:58 GMT</pubDate></item><item><title><![CDATA[ssd和hdd混合]]></title><description><![CDATA[<p dir="auto"><a href="https://serverfault.com/questions/1144193/kubernetes-rook-ceph-different-pools-for-different-use-cases" rel="nofollow ugc">https://serverfault.com/questions/1144193/kubernetes-rook-ceph-different-pools-for-different-use-cases</a></p>
]]></description><link>http://an.forum.genostack.com/topic/1013/ssd和hdd混合</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/1013/ssd和hdd混合</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Tue, 12 Dec 2023 06:13:25 GMT</pubDate></item><item><title><![CDATA[NCBI ENA的数据库设计]]></title><description><![CDATA[<p dir="auto"><a href="https://docs.ropensci.org/restez/" rel="nofollow ugc">https://docs.ropensci.org/restez/</a><br />
NOTE: Starting with v2.0.0, the database backend changed from MonetDBLite to <strong>duckdb</strong>. Because of this change, restez v2.0.0 or higher is not compatible with databases built with previous versions of restez.</p>
]]></description><link>http://an.forum.genostack.com/topic/1010/ncbi-ena的数据库设计</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/1010/ncbi-ena的数据库设计</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 11 Dec 2023 06:50:35 GMT</pubDate></item><item><title><![CDATA[Pubmed]]></title><description><![CDATA[<p dir="auto">The updated version of PubMed takes advantage of several new technologies to improve the user experience. The underlying document data indexed in the updated version is a merger of content from PubMed, Bookshelf and PubMed Central (PMC). This combined dataset allows us to display relevant information not previously available in a PubMed record, such as reference citations from PMC. While legacy PubMed limited the number of variants for a wildcard (‘*’) search, PubMed is now capable of unlimited wildcard searches thanks to <strong>Solr</strong> (<a href="https://lucene.apache.org/solr/" rel="nofollow ugc">https://lucene.apache.org/solr/</a>), the open-source enterprise search system that PubMed now uses for document indexing. Users will find that PubMed now has greater scalability and reliability, provided not only by Solr, but also by the <strong>MongoDB</strong> storage solution and the modern cloud architecture that together ensure both redundancy between data centers and also trustworthy backup environments. When visiting PubMed, users will enjoy a modern web experience using the latest web technologies and standards, all provided by the <strong>Django</strong> web framework.</p>
<p dir="auto"><a href="https://academic.oup.com/nar/article/49/D1/D10/5937080" rel="nofollow ugc">https://academic.oup.com/nar/article/49/D1/D10/5937080</a></p>
]]></description><link>http://an.forum.genostack.com/topic/1009/pubmed</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/1009/pubmed</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 11 Dec 2023 03:55:23 GMT</pubDate></item><item><title><![CDATA[minikube使用指南]]></title><description><![CDATA[<p dir="auto"><a href="https://tech.olx.com/running-spark-on-kubernetes-a-fully-functional-example-and-why-it-makes-sense-for-olx-d56b6a61fcbe" rel="nofollow ugc">https://tech.olx.com/running-spark-on-kubernetes-a-fully-functional-example-and-why-it-makes-sense-for-olx-d56b6a61fcbe</a></p>
]]></description><link>http://an.forum.genostack.com/topic/993/minikube使用指南</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/993/minikube使用指南</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Tue, 05 Sep 2023 08:09:40 GMT</pubDate></item><item><title><![CDATA[知识图谱]]></title><description><![CDATA[<p dir="auto"><a href="https://www.nature.com/articles/s41587-023-01848-y" rel="nofollow ugc">https://www.nature.com/articles/s41587-023-01848-y</a></p>
]]></description><link>http://an.forum.genostack.com/topic/978/知识图谱</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/978/知识图谱</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Fri, 28 Jul 2023 07:32:19 GMT</pubDate></item><item><title><![CDATA[使用Jupytext将Rmd 转成jpuyter notebook]]></title><description><![CDATA[<p dir="auto"><a href="https://codes.correlaid.org/first%20steps/r/jupyter/2021/03/02/Convert-Rmd-files-to-Jupyter-Notebook.html" rel="nofollow ugc">https://codes.correlaid.org/first steps/r/jupyter/2021/03/02/Convert-Rmd-files-to-Jupyter-Notebook.html</a><br />
jupytext --to notebook script.Rmd</p>
]]></description><link>http://an.forum.genostack.com/topic/976/使用jupytext将rmd-转成jpuyter-notebook</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/976/使用jupytext将rmd-转成jpuyter-notebook</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Wed, 26 Jul 2023 10:10:25 GMT</pubDate></item><item><title><![CDATA[GenoStack对大数据分析的支持]]></title><description><![CDATA[<p dir="auto">运行spark-shell<br />
./bin/spark-shell   --master k8s://192.168.39.6:8443   --conf spark.kubernetes.container.image=spark   --conf spark.kubernetes.context=minikube   --conf spark.kubernetes.namespace=spark-demo   --verbose</p>
]]></description><link>http://an.forum.genostack.com/topic/972/genostack对大数据分析的支持</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/972/genostack对大数据分析的支持</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 20 Jul 2023 09:54:29 GMT</pubDate></item><item><title><![CDATA[数据变现]]></title><description><![CDATA[<p dir="auto"><img src="/assets/uploads/files/1689153262885-7134af68-82a2-4deb-b1fa-a0194ae0f572-image.png" alt="7134af68-82a2-4deb-b1fa-a0194ae0f572-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/topic/960/数据变现</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/960/数据变现</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Wed, 12 Jul 2023 09:14:26 GMT</pubDate></item><item><title><![CDATA[数据库的选择]]></title><description><![CDATA[<p dir="auto"><img src="/assets/uploads/files/1688622158529-517349ac-457c-48eb-95a8-533580d7a8df-image.png" alt="517349ac-457c-48eb-95a8-533580d7a8df-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/topic/948/数据库的选择</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/948/数据库的选择</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 06 Jul 2023 05:42:40 GMT</pubDate></item><item><title><![CDATA[Bioverse]]></title><description><![CDATA[<p dir="auto"><img src="/assets/uploads/files/1688363893122-1687564622158.gif" alt="1687564622158.gif" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/topic/937/bioverse</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/937/bioverse</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 03 Jul 2023 05:57:32 GMT</pubDate></item><item><title><![CDATA[solara]]></title><description><![CDATA[<p dir="auto"><a href="https://solara.dev/docs/tutorial/streamlit" rel="nofollow ugc">https://solara.dev/docs/tutorial/streamlit</a>  和streamlit的对比</p>
]]></description><link>http://an.forum.genostack.com/topic/926/solara</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/926/solara</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Wed, 21 Jun 2023 09:30:34 GMT</pubDate></item><item><title><![CDATA[Netflix架构]]></title><description><![CDATA[<p dir="auto">a64aff4c-5a53-4106-850a-e83731d759a0-image.png</p>
]]></description><link>http://an.forum.genostack.com/topic/922/netflix架构</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/922/netflix架构</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Wed, 14 Jun 2023 05:40:06 GMT</pubDate></item><item><title><![CDATA[如何选择数据库]]></title><description><![CDATA[<p dir="auto"><img src="/assets/uploads/files/1686023382339-b170b5c8-e4ca-48a2-b5bc-1608b3b4476f-image.png" alt="b170b5c8-e4ca-48a2-b5bc-1608b3b4476f-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/topic/916/如何选择数据库</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/916/如何选择数据库</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Tue, 06 Jun 2023 03:49:43 GMT</pubDate></item><item><title><![CDATA[不同的数据岗位]]></title><description><![CDATA[<p dir="auto"><img src="/assets/uploads/files/1685591662090-5c2f5180-55e5-45de-80a9-9c89938dc4b6-image.png" alt="5c2f5180-55e5-45de-80a9-9c89938dc4b6-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/topic/913/不同的数据岗位</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/913/不同的数据岗位</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 01 Jun 2023 03:54:23 GMT</pubDate></item><item><title><![CDATA[Data Architecture and Data Engineering]]></title><description><![CDATA[<p dir="auto">data pipeline.gif</p>
]]></description><link>http://an.forum.genostack.com/topic/912/data-architecture-and-data-engineering</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/912/data-architecture-and-data-engineering</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 01 Jun 2023 03:46:29 GMT</pubDate></item><item><title><![CDATA[cocalc]]></title><description><![CDATA[<p dir="auto"><a href="https://doc.cocalc.com/index.html" rel="nofollow ugc">https://doc.cocalc.com/index.html</a><br />
Hello, and welcome to CoCalc. CoCalc is a virtual online workspace for calculations, research, collaboration and authoring documents. Your web browser is all you need to escape the confined space of your desktop and move to the cloud. This guide explains the features of CoCalc in depth and shows how you can use them productively.</p>
]]></description><link>http://an.forum.genostack.com/topic/911/cocalc</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/911/cocalc</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Wed, 31 May 2023 02:09:02 GMT</pubDate></item><item><title><![CDATA[SQL执行顺序]]></title><description><![CDATA[<p dir="auto"><img src="/assets/uploads/files/1685419054559-16d3e874-915f-4d84-9f62-d0a57ed012e6-image.png" alt="16d3e874-915f-4d84-9f62-d0a57ed012e6-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/topic/910/sql执行顺序</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/910/sql执行顺序</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Tue, 30 May 2023 03:57:35 GMT</pubDate></item><item><title><![CDATA[apache data stack]]></title><description><![CDATA[<p dir="auto"><img src="/assets/uploads/files/1685330505750-37c50bfa-8e7f-4c30-8917-33052f20f1e4-image.png" alt="37c50bfa-8e7f-4c30-8917-33052f20f1e4-image.png" class=" img-responsive img-markdown" /></p>
]]></description><link>http://an.forum.genostack.com/topic/907/apache-data-stack</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/907/apache-data-stack</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 29 May 2023 03:24:53 GMT</pubDate></item><item><title><![CDATA[delta lake k8s spark]]></title><description><![CDATA[<p dir="auto">dd933512-b64f-434b-bfd1-468b5cf6bcb2-image.png</p>
]]></description><link>http://an.forum.genostack.com/topic/906/delta-lake-k8s-spark</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/906/delta-lake-k8s-spark</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Thu, 25 May 2023 16:54:09 GMT</pubDate></item><item><title><![CDATA[生物信息领域github topic]]></title><description><![CDATA[<p dir="auto">生物学：<br />
genomics<br />
biology<br />
Microbiology<br />
Agrigenomics<br />
bioinformatics<br />
cell<br />
gene<br />
gene expression<br />
gene regulation<br />
genetics<br />
Population Genomics<br />
genome<br />
DNA<br />
RNA<br />
Protein<br />
epigenetics<br />
transcripts</p>
<p dir="auto">化学：<br />
molecular</p>
<p dir="auto">临床研究类：<br />
cancer<br />
Complex Disease<br />
Clinical<br />
genetic disease<br />
Oncology<br />
Pharmacogenomics<br />
Drug Discovery<br />
Immunogenomics<br />
Neurogenomics<br />
pathogen</p>
<p dir="auto">技术：<br />
next-generation sequencing<br />
illumina<br />
pacbio<br />
nanopore<br />
python<br />
R<br />
spark<br />
statistics<br />
Linux<br />
bash<br />
julia<br />
javasctipt</p>
]]></description><link>http://an.forum.genostack.com/topic/902/生物信息领域github-topic</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/902/生物信息领域github-topic</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Wed, 24 May 2023 03:52:40 GMT</pubDate></item><item><title><![CDATA[spark在生信领域的应用]]></title><description><![CDATA[<p dir="auto"><a href="https://www.sciencedirect.com/science/article/pii/S2405844023005753" rel="nofollow ugc">https://www.sciencedirect.com/science/article/pii/S2405844023005753</a></p>
]]></description><link>http://an.forum.genostack.com/topic/900/spark在生信领域的应用</link><guid isPermaLink="true">http://an.forum.genostack.com/topic/900/spark在生信领域的应用</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Mon, 22 May 2023 10:51:53 GMT</pubDate></item></channel></rss>