Lead Data Engineer

October 5

Apply Now
Logo of StrongDM

StrongDM

51 - 200

💰 $54M Series B on 2021-09

Description

•StrongDM is driven by a clear mission: Secure Access, Zero Trust. •We design products and solutions that reflect this commitment, transforming the way organizations manage privileged access across their critical infrastructure. •By leading with Zero Trust Privileged Access Management (PAM), we help our customers achieve secure, dynamic, and fine-grained control over access to their most sensitive resources. •This focus on security has earned us an industry-leading 98% customer retention rate. •Once a customer, forever a fan. That's our goal. •When you work at StrongDM, you join a team committed to solving today’s security challenges with technology that works and customers who trust us to protect their most critical assets. •If you ask anyone at StrongDM, you’ll find that our values truly guide everything we do—from how we innovate to how we treat each other. •These values are the foundation of our culture and define who we are as a company. •If this sounds like an environment where you’d thrive, read on. •We are seeking a highly skilled Principal Data Engineer with extensive experience in building cloud data lakes and architecting large-scale data platforms. •You will be instrumental in designing and implementing data architectures that support diverse use cases, AI/ML to business intelligence (BI). •The ideal candidate will have deep expertise in tabular formats like Apache Iceberg, Apache Parquet, and other open standards. •As a Lead Data Engineer, you will work closely with data scientists, AI teams, and business stakeholders to ensure that our data infrastructure is robust, scalable, and optimized for a variety of computational workloads. •This role requires an innovative mindset and the ability to lead data engineering projects, making key architectural decisions that shape our data ecosystem.

Requirements

•Big Data Technologies: Strong knowledge of big data processing frameworks and data streaming technologies. •AI/ML Data Integration: Experience collaborating with AI/ML teams, building data pipelines that feed AI models, and ensuring data readiness for machine learning workflows. •Experience in Cloud Data Lakes: Proven experience in architecting and building data lakes on cloud platforms (AWS, Azure, GCP). •Open Standards Expertise: In-depth knowledge of Apache Iceberg, Apache Parquet, and other open standards for efficient data storage and query optimization. •Compute Engines: Expertise in using compute engines such as Apache Spark, Dremio, Presto, or similar, with hands-on experience in optimizing them for business intelligence and AI workloads. •Leadership: Proven track record of leading large-scale data engineering projects and mentoring teams. •Programming Languages: Proficiency in languages such as Python, Java, or Scala, and SQL for querying and managing large datasets. •AI/ML Workflows: Previous experience working directly with AI or machine learning teams preferred •Distributed Systems: A deep understanding of distributed systems and the challenges of scaling data infrastructure in large, dynamic environments preferred •Data Warehouse Experience: Familiarity with modern data warehousing solutions such as Snowflake or Redshift preferred

Benefits

•$190,000-$230,000 DOE + equity salary packages •Company-sponsored benefits, including: •Medical, dental, and vision insurance (free to employees and dependents) •401K, HSA, FSA, short/long-term disability coverage, life insurance •6 weeks of combined accrued vacation + sick time •Volunteer days + standard holidays •24 weeks paid parental leave for everyone + 1 month transition time back + childcare stipend for first year •Generous monthly and annual stipend for internet + home office

Apply Now

Similar Jobs

October 4

Design and maintain data infrastructure for CDC Foundation's public health initiatives.

October 4

Data Engineer to build and maintain data infrastructure for CDC Foundation's health programs.

October 4

Beyond Finance

1001 - 5000

Supports data operations for Beyond Finance's data ecosystem as a Data Operations Engineer.

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com