<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[epam招聘中提到spark pyspark]]></title><description><![CDATA[<p dir="auto">About the job</p>
<p dir="auto">Description</p>
<p dir="auto">We are looking for a Bioinformatics Engineer to join our team in Serbia</p>
<p dir="auto">Requirements</p>
<p dir="auto">Familiarity with Molecular Biology data and techniques associated with:<br />
NGS data, processing and analysis<br />
GWAS<br />
Genome assembly<br />
Genome annotation<br />
SNPs and Haplotypes</p>
<p dir="auto">Strong proficiency in SQL (PostgreSQL, MySQL, Oracle)<br />
Solid experience with AWS (S3, EC2, Fargate, ECS, Lambda). AWS certification is a plus<br />
Big Data and large-scale data processing:<br />
Apache Spark (PySpark, Sparklyr)</p>
<p dir="auto">Solid working experience with Python:<br />
Solid experience with OOP<br />
Poetry, Pydantic<br />
Good knowledge of ORM (sqlAlchemia)<br />
Knowledge of any of REST-frameworks (Flask, FastAPI, Django) is a plus<br />
Unit-testing, TDD<br />
Packaging Python packages and publishing to PIP/Conda<br />
AsyncIO, multiprocessing / multithreading</p>
<p dir="auto">Solid experience with Git, knowledge of different branching strategies (Gitlfow, GitHub flow, Trunk Based Development)<br />
Deep understanding of CI/CD process, hands-on experience with Jenkins<br />
Hands-on experience with Docker, Docker compose<br />
Confident Linux user, bash scripting</p>
<p dir="auto">Nice to have</p>
<p dir="auto">RStudio ecosystem:<br />
RStudio Pro<br />
RStudio Package Manager<br />
RStudio connect<br />
CRAN</p>
<p dir="auto">Python ecosystem:<br />
PIP / Conda / Mamba package management<br />
JupyterLab</p>
<p dir="auto">Linux / HPC:<br />
Confident operating high-performance cluster systems via schedulers such as SLURM or Univa Grid Engine (UGE)/Sun Grid Engine (SGE)</p>
<p dir="auto">Deep learning frameworks:<br />
PyTorch<br />
TensorFlow</p>
<p dir="auto">We offer</p>
<p dir="auto">Dynamic, entrepreneurial, high speed, high growth corporate environment<br />
Diverse multicultural, multi-functional, and multilingual work environment<br />
Opportunities for personal and career growth in a progressive industry<br />
Global scope, international projects<br />
Widespread training and development opportunities<br />
Unlimited access to LinkedIn learning solutions<br />
Competitive salary and various benefits<br />
Sport and social teams support, recreation area, advanced CSR programs</p>
]]></description><link>http://an.forum.genostack.com/topic/773/epam招聘中提到spark-pyspark</link><generator>RSS for Node</generator><lastBuildDate>Sat, 13 Jun 2026 14:28:34 GMT</lastBuildDate><atom:link href="http://an.forum.genostack.com/topic/773.rss" rel="self" type="application/rss+xml"/><pubDate>Tue, 01 Nov 2022 02:16:59 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to epam招聘中提到spark pyspark on Tue, 01 Nov 2022 02:20:02 GMT]]></title><description><![CDATA[<p dir="auto"><a href="https://www.nature.com/naturecareers/job/fullstack-developer-european-molecular-biology-laboratory-embl-765829" rel="nofollow ugc">https://www.nature.com/naturecareers/job/fullstack-developer-european-molecular-biology-laboratory-embl-765829</a><br />
EMBL招聘也需要spark</p>
]]></description><link>http://an.forum.genostack.com/post/1872</link><guid isPermaLink="true">http://an.forum.genostack.com/post/1872</guid><dc:creator><![CDATA[anneng]]></dc:creator><pubDate>Tue, 01 Nov 2022 02:20:02 GMT</pubDate></item></channel></rss>