Site Reliability Engineer

November 27

Apply Now

Description

• Collaborate with software engineering and operations teams to design, build, and maintain cloud-based infrastructure using AWS and Terraform. • Implement and enhance infrastructure-as-code (IaC) practices using Terraform to ensure reproducibility and scalability of infrastructure components. • Develop and maintain monitoring solutions to proactively identify performance bottlenecks, system outages, and other potential issues. • Participate in incident response and root cause analysis efforts to drive continuous improvement and prevent future incidents. • Optimise system performance, reliability, and cost efficiency through continuous monitoring, performance tuning, and capacity planning. • Identify opportunities to automate manual processes and improve system resilience. • Utilise Python or Bash scripting to create and maintain automation tools for various operational tasks and deployments. • Collaborate with security teams to implement best practices for securing cloud infrastructure and services. • Ensure compliance with relevant industry standards and regulations. • Support CI/CD pipelines for application deployments and updates. • Maintain clear and up-to-date documentation for infrastructure configurations, processes, and incident resolution procedures.

Requirements

• Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience). • Proven experience as a Site Reliability Engineer or similar role. • Extensive experience with Amazon Web Services (AWS) and its core services (EC2, S3, RDS, IAM, etc.). • Strong proficiency in infrastructure-as-code (IaC) tools, with a focus on Terraform. • Proficient in scripting with Python or Bash for automation and operational tasks. • Solid understanding of networking principles and protocols. • Knowledge of CI/CD pipelines and related tools.

Apply Now

Similar Jobs

November 22

As DevOps Manager at OnBuy, manage operations and cloud infrastructure for a growing marketplace.

🇬🇧 United Kingdom – Remote

💵 £80k - £90k / year

💰 Debt Financing on 2021-07

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

November 9

Drive development and operations of Kahootz's secure cloud collaboration platform.

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com