IT Consultancy • Continuous Delivery • Offshore Services • Deployment Automation • Digital Transformation
5001 - 10000
23 hours ago
AWS
Azure
Cloud
Distributed Systems
Docker
ETL
Google Cloud Platform
Informatica
Kubernetes
Numpy
Pandas
PySpark
Python
Spark
SQL
Go
IT Consultancy • Continuous Delivery • Offshore Services • Deployment Automation • Digital Transformation
5001 - 10000
• responsible for at-scale infrastructure design, build and deployment with a focus on distributed systems • building and maintaining architecture patterns for data processing, workflow definitions, and system to system integrations using Big Data and Cloud technologies • evaluating and translating technical design to workable technical solutions/code and technical specifications at par with industry standards • driving creation of re-usable artifacts • establishing scalable, efficient, automated processes for data analysis, data model development, validation, and implementation • working closely with analysts/data scientists to understand impact to the downstream data models • writing efficient and well-organized software to ship products in an iterative, continual release environment • contributing and promoting good software engineering practices across the team • communicating clearly and effectively to technical and non-technical audiences • defining data retention policies • monitoring performance and advising any necessary infrastructure changes
• 2+ years’ experience with Azure (Data Factory, Databricks) • 3+ years’ experience with data engineering or backend/fullstack software development • solid SQL and Git skills • Python scripting proficiency • experience with data transformation tools - Databricks and Spark • experience in structuring and modelling data in both relational and non-relational forms • ability to elaborate and propose relational/non-relational approach, normalization / denormalization and data warehousing concepts (star, snowflake schemas) • good verbal and written communication skills in English
Apply NowSeptember 25
201 - 500
Develop data pipelines and maintain a data ecosystem for customer solutions.