Senior Site Reliability Engineer

October 11

🇨🇷 Costa Rica – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Apply Now
Logo of Acquia

Acquia

Open Source Social Business Software • Technical Support & Expert Consulting for Drupal • Fully-Managed Drupal Cloud Hosting for Enterprises • Drupal Software-as-a-Service • Digital Experience Platform

1001 - 5000

💰 Secondary Market on 2018-08

Description

• Acquia empowers brands to create digital customer experiences that matter. • As a Senior Site Reliability Engineer, design, implement, and maintain CI/CD pipelines, cloud infrastructure, and monitoring solutions. • Expertise in tools like ArgoCD, Kubernetes, and cloud-native architecture. • Innovate and design new infrastructure solutions for operational excellence at scale. • Ensure engineering teams have the right infrastructure in place for rapid, safe, and reliable deployment.

Requirements

• BS in Computer Science or a comparable field of study, or equivalent practical experience. • Experience working with one or more of: Go, Python, Ruby, PHP, Java or Javascript. • Experience with Unix/Linux systems administration using the CLI. • Fundamental understanding of TCP/UDP networking concepts • Solid oral and written communications skills. • CI/CD Expertise: Extensive hands-on experience with CI/CD tools such as ArgoCD, Jenkins, CircleCI, or GitLab CI. Ability to design and implement pipelines that ensure rapid, reliable deployments. • Kubernetes Guru: Strong understanding and experience with Kubernetes, Helm, and container orchestration. Ability to scale and manage microservices in production. • Cloud Mastery: Proficient in at least one major cloud provider—AWS, GCP, or Azure. Experience with multi-cloud or hybrid-cloud architecture is a plus. • IaC Champion: Proficiency in Terraform, Ansible, or CloudFormation to manage infrastructure as code. Familiarity with GitOps workflows and version-controlled infrastructure. • Monitoring & Observability: Strong experience with monitoring tools like Prometheus, Grafana, Datadog, ELK, or New Relic. Ability to build custom dashboards and alerting systems. • Security-Focused: Deep understanding of security best practices in DevOps, including container security, CI/CD pipeline security, and cloud infrastructure hardening. • Problem Solver: Excellent troubleshooting skills with the ability to diagnose issues across a variety of environments, from code to infrastructure. • Collaboration Skills: Ability to work effectively in cross-functional teams, influencing peers and driving adoption of best practices across the organization.

Apply Now

Similar Jobs

August 20

Granicus

501 - 1000

Develop and automate systems for Granicus's diverse portfolio in Govtech.

🇨🇷 Costa Rica – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com