Join our Facebook group

👉 Remote Jobs Network

Senior Big Data Engineer - Airflow and Oozie (GCP)

May 1

Apply Now
Logo of Rackspace Technology

Rackspace Technology

Realize the full value of the cloud.

IT as a Service • Multi-Cloud • Managed Hosting • Managed AWS/Azure/Google Cloud Platform/OpenStack/Alibaba • Managed Private Cloud for VMware/Microsoft/OpenStack

5001 - 10000

Description

• Responsible for developing batch processing systems using technologies like Hadoop, Oozie, Pig, Hive, Map Reduce, Spark (Java), Python, and Hbase • Manage and optimize data workflows using Oozie and Airflow within the Apache Hadoop ecosystem • Utilize GCP for scalable big data processing and storage solutions • Implement automation/DevOps best practices for CI/CD, IaC, etc.

Requirements

• Experience with GCP managed services and understanding of cloud-based batch processing systems are critical. • Proficiency in Oozie, Airflow, Map Reduce, Java • Strong programming skills with Java (specifically Spark), Python, Pig, and SQL • Expertise in public cloud services, particularly in GCP. • Proficiency in the Apache Hadoop ecosystem with Oozie, Pig, Hive, Map Reduce • Familiarity with BigTable and Redis • Experienced in Infrastructure and Applied DevOps principles in daily work. Utilize tools for continuous integration and continuous deployment (CI/CD), and Infrastructure as Code (IaC) like Terraform to automate and improve development and release processes. • Ability to tackle complex challenges and devise effective solutions. Use critical thinking to approach problems from various angles and propose innovative solutions. • Worked effectively in a remote setting, maintaining strong written and verbal communication skills. Collaborate with team members and stakeholders, ensuring clear understanding of technical requirements and project goals. • Proven experience in engineering batch processing systems at scale. • Hands-on experience in public cloud platforms, particularly GCP. Additional experience with other cloud technologies is advantageous.

Benefits

• Develop scalable and robust code for batch processing systems. This includes working with technologies like Hadoop, Oozie, Pig, Hive, Map Reduce, Spark (Java), Python, Hbase • Develop, Manage and optimize data workflows using Oozie and Airflow within the Apache Hadoop ecosystem • Leverage GCP for scalable big data processing and storage solutions • Implementing automation/DevOps best practices for CI/CD, IaC, etc.

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com