Principal Data Engineer

September 28

Apply Now
Logo of Seamless.AI

Seamless.AI

Artificial Intelligence β€’ Machine Learning β€’ Natural Language Processing β€’ Neuro-Linguistic Programming β€’ Data Science

201 - 500

Description

β€’ Design, develop, and maintain robust and scalable ETL pipelines to acquire, transform, and load data from various sources into our data ecosystem. β€’ Collaborate with cross-functional teams to understand data requirements and develop efficient data acquisition and integration strategies. β€’ Implement data transformation logic using Python and other relevant programming languages and frameworks. β€’ Utilize AWS Glue or similar tools to create and manage ETL jobs, workflows, and data catalogs. β€’ Optimize and tune ETL processes for improved performance and scalability, particularly with large data sets. β€’ Apply methodologies and techniques for data matching, deduplication, and aggregation to ensure data accuracy and quality. β€’ Implement and maintain data governance practices to ensure compliance, data security, and privacy. β€’ Collaborate with the data engineering team to explore and adopt new technologies and tools that enhance the efficiency and effectiveness of data processing.

Requirements

β€’ 7+ years of experience as a Data Engineer, with a focus on ETL processes and data integration. β€’ Professional experience with Spark and AWS pipeline development required. β€’ Bachelor's degree in Computer Science, Information Systems, related fields or equivalent years of work experience.

Apply Now

Similar Jobs

September 27

Transform Medicaid Drug Program data for public benefit at Blue Tiger.

Built byΒ Lior Neu-ner. I'd love to hear your feedback β€” Get in touch via DM or lior@remoterocketship.com