Yesterday
Airflow
Ansible
AWS
Chef
Cloud
Google Cloud Platform
Java
JavaScript
Jenkins
Kubernetes
Oracle
Python
Terraform
Go
• Lead initiatives to enhance and optimize existing cloud infrastructure driving improvements in scalability and resilience. • Refine and deploy cloud infrastructure solutions that align with the product and technical roadmap. • Oversee and manage large-scale projects related to cloud platforms, automation, and performance optimization. • Continuously assess and refine cloud architecture, focusing on security, scalability, and performance across cloud infrastructure. • Utilize advanced programming skills to develop and optimize tools for infrastructure management and automation. • Write, review, and maintain code in scripting languages (Python, JavaScript) and system programming languages (GoLang). • Collaborate with engineering teams to integrate SRE principles into the product lifecycle. • Develop and implement automation strategies to enhance system deployment, monitoring, and operational efficiency. • Design and manage CI/CD pipelines to improve the speed, reliability, and consistency of software delivery. • Maintain and support production systems and associated infrastructure. • Work closely with cross-functional teams to understand product and technical roadmaps. • Mentor junior SREs, sharing best practices in cloud architecture, automation, and incident management. • Lead the effort to continuously improve the availability, scalability, and efficiency of systems across the cloud.
• B.E./B.Tech in Computer Science or a related field, or equivalent experience. • A minimum of 6+ years of industry experience in site reliability engineering, system engineer, or a related role, ideally in large-scale environments, with a focus on supporting 24x7 highly-available systems. • Advanced scripting skills in languages such as Python, Golang, Java, with the ability to write fully functional scripts/programs for automation and tool development. • Hands-on experience with cloud platforms (AWS or GCP), with a strong understanding of cloud architecture best practices, deployment strategies, and scaling within these ecosystems. • Deep knowledge of containerization and orchestration, particularly with Kubernetes, and practical experience managing large-scale containerized environments. • Expertise in running infrastructure automation tools at scale, such as Git, Airflow, Jenkins, Screwdriver, for managing code deployments and continuous integration workflows. • Proficient in Infrastructure as Code (IaC) tools, such as Ansible, Chef, Terraform, with experience automating infrastructure at scale. • Understanding of DevOps methodologies and practices, promoting collaboration between development and operations teams for improved service delivery. • Experience with monitoring and logging solutions, enabling proactive identification and resolution of issues. • Familiarity with incident and change management frameworks, such as ITIL or other industry standards. • Proven experience in mentoring and developing junior SREs. • Demonstrated ability to provide technical leadership in incident, change, and problem management activities, especially in high-pressure, production environments.
Apply NowYesterday
51 - 200
Join TechBiz Global as a DevOps Support Engineer focusing on Azure solutions, providing technical support.
🇮🇳 India – Remote
⏰ Full Time
🟢 Junior
🟡 Mid-level
⛑ DevOps & Site Reliability Engineer (SRE)
🚫👨🎓 No degree required
3 days ago
51 - 200
Join Imagine.io as a Senior DevOps Engineer to lead MLOps and DevOps solutions in 3D visualization.
6 days ago
5001 - 10000
Join Teladoc Health as a DevOps Engineer to enhance product performance through automation.
November 12
1001 - 5000
DevOps Engineer streamlines builds for Sophos' cybersecurity platform.
🇮🇳 India – Remote
💰 Post-IPO Equity on 2021-08
⏰ Full Time
🟡 Mid-level
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
November 10
11 - 50
DevOps Engineer for cloud security and monitoring solutions at Anteelo.
🇮🇳 India – Remote
💵 ₹1.2M - ₹1.4M / year
⏰ Full Time
🟢 Junior
🟡 Mid-level
⛑ DevOps & Site Reliability Engineer (SRE)
🚫👨🎓 No degree required