sars-cov-2-variant-calling
-
https://www.ncbi.nlm.nih.gov/sra/docs/sars-cov-2-variant-calling/
To support ease of access via Athena, the VCF is first converted to SPDI format and then to Parquet format. The Parquet format supports direct queries in Athena, and users can identify runs containing specific SARS-CoV-2 variants.
这个文档提到了把vcf转换成Parquet