<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[常见的生物信息格式转换成统一的parquet文件]]></title><description><![CDATA[<p dir="auto"><a href="https://github.com/BlueGranite/azure-synapse-vcf-analysis/blob/main/ConvertVCFsToParquet.md" rel="nofollow ugc">https://github.com/BlueGranite/azure-synapse-vcf-analysis/blob/main/ConvertVCFsToParquet.md</a><br />
<a href="https://techcommunity.microsoft.com/t5/healthcare-and-life-sciences/genomic-data-in-parquet-format-on-azure/ba-p/3150554" rel="nofollow ugc">https://techcommunity.microsoft.com/t5/healthcare-and-life-sciences/genomic-data-in-parquet-format-on-azure/ba-p/3150554</a><br />
<a href="https://techcommunity.microsoft.com/t5/healthcare-and-life-sciences/convert-synthetic-fhir-and-pacbio-vcf-data-to-parquet-and/ba-p/3577038" rel="nofollow ugc">https://techcommunity.microsoft.com/t5/healthcare-and-life-sciences/convert-synthetic-fhir-and-pacbio-vcf-data-to-parquet-and/ba-p/3577038</a><br />
微软的Azure使用的parquet格式</p>
<p dir="auto">主要使用的是Glow<br />
<a href="https://medium.com/23andme-engineering/genetic-datastore-4b213256db31" rel="nofollow ugc">https://medium.com/23andme-engineering/genetic-datastore-4b213256db31</a></p>
<p dir="auto"><a href="https://github.com/natir/vcf2parquet" rel="nofollow ugc">https://github.com/natir/vcf2parquet</a><br />
一个RUST项目 感觉很多小工具使用的是RUST 可能性能比较高</p>
<p dir="auto"><a href="https://github.com/BigDataWUR/tomatula" rel="nofollow ugc">https://github.com/BigDataWUR/tomatula</a></p>
<p dir="auto"><a href="https://documentation.dnanexus.com/user/spark/example-applications/vcf-loader" rel="nofollow ugc">https://documentation.dnanexus.com/user/spark/example-applications/vcf-loader</a><br />
<a href="https://adam.readthedocs.io/en/latest/api/genomicDataset/" rel="nofollow ugc">https://adam.readthedocs.io/en/latest/api/genomicDataset/</a></p>
<p dir="auto"><a href="https://www.biostars.org/p/9566003/" rel="nofollow ugc">https://www.biostars.org/p/9566003/</a></p>
<p dir="auto"><a href="https://github.com/natir/variantplaner" rel="nofollow ugc">https://github.com/natir/variantplaner</a></p>
]]></description><link>http://an.forum.genostack.com/topic/1044/常见的生物信息格式转换成统一的parquet文件</link><generator>RSS for Node</generator><lastBuildDate>Sat, 13 Jun 2026 12:34:08 GMT</lastBuildDate><atom:link href="http://an.forum.genostack.com/topic/1044.rss" rel="self" type="application/rss+xml"/><pubDate>Sun, 04 Feb 2024 05:05:49 GMT</pubDate><ttl>60</ttl></channel></rss>