Staff Software Engineer, Data Infrastructure

July 31

Apply Now
Logo of Helix

Helix

Helix is the leading population genomics and viral surveillance company.

Consumer Genetics • Next Generation Sequencing (NGS) • Bioinformatics • Consumer Insights • Big Data

51 - 200

Description

• The Helix Data Engineering team plays a pivotal role in Helix’s efforts to provide a first-in-class clinicogenomics research dataset that serves our internal research team, provides operational insights back to health systems, and is a valuable asset in our growing Life Science business. • Working closely with Research, Bioinformatics, and other Engineering teams, we are responsible for maintaining infrastructure that enables secure analysis of this quickly growing dataset. • The patient is top of mind in everything we do, and your contributions here have the opportunity to improve the real world outcomes for everyone. • Maintain and evolve infrastructure that allows scientists to process and analyze Helix-produced clinicogenomics datasets. • Drive data infrastructure and data management strategy for clinical and genomic data, contributing to platform components and pipelines that increase the value and usability of these key assets. • Collaborate and work well cross functionally with product managers, bioinformaticians, scientists, other engineers, and business leaders. • Establish and maintain strong engineering best practices. • Own systems and services from development to production. • Mentor other team members to reinforce a culture of learning and teaching.

Requirements

• Reside in the US, Canada, Mexico, Chile or Colombia • Bachelor's/Master's degree in Computer Science, Bioinformatics, Engineering, Mathematics, or a related field with 7+ years of experience; or PhD with 2+ years of experience • Proven experience in data engineering • Proficiency in Python, Go, Java, Scala, or similar • Proficiency with distributed systems built on cloud infrastructure — AWS or similar • Experience with infrastructure-as-code tooling/frameworks (e.g., Terraform, Cloudformation, AWS CDK) • Experience with authentication protocols such as OAuth, OIDC, SAML, and JWT • Proficiency in managing Identity and Access Management (IAM) configurations • Expertise with distributed compute frameworks such as Spark, Dask, EMR, Databricks, or similar • Expertise with ETL pipeline automation and workflow management tools such as Airflow, AWS Glue, AWS Step Functions, and CI/CD • Familiarity with database design, data manipulation, and data quality techniques • Adaptable in a fast-paced startup environment where priorities may change quickly and frequently • Demonstrated willingness to learn new domains (e.g. genomics, healthcare) and associated technologies

Benefits

• Comprehensive Health Insurance with Date of Hire eligibility • Above average employer paid premium coverage • 12 weeks Helix Paid Parental Leave option • 401(k) with employer matching of up to 3% and 100% Vesting on the Date of Hire • Comprehensive Well-Being Benefits • 18 well-being programs covering financial, legal and wellness solutions • Flexible PTO • Remote options for many roles and a home office stipend

Apply Now

Similar Jobs

July 26

EarnIn

201 - 500

Develop a unified framework supporting microservice and API developers across different language runtimes.

July 24

Samsara

1001 - 5000

Technical leader for self-service data architecture at a company focusing on IoT solutions.

July 23

Discord

501 - 1000

Revolutionize user interaction with innovative ad formats and marketplace solutions.

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com