Staff Site Reliability Engineer

June 28

🇮🇳 India – Remote

💵 $120k - $140k / year

⏰ Full Time

🔴 Lead

👨🏻‍🔧 Site Reliability Engineer (SRE)

Apply Now
Logo of Everbridge

Everbridge

Keeping people safe and organizations running. Faster.

Critical Communications • IT Alerting • Incident Management • Clinical Communications • Secure Messaging

1001 - 5000

Description

• Design, build, and maintain scalable, reliable, and secure AWS and Kubernetes infrastructure to support applications and services. • Monitor system performance and reliability metrics, troubleshoot issues, and implement solutions to minimize downtime and performance degradation. • Collaborate with cross-functional teams to design and develop reliable, fault-tolerant systems.

Requirements

• 5+ years of experience in building and managing production grade infrastructure in AWS, Kubernetes and/or EKS. • In-depth knowledge of AWS services, including but not limited to EC2, S3, VPC, IAM, ECR, Route53, and API Gateway. • Familiarity with security best practices in cloud environments, including identity and access management (IAM), encryption, and compliance standards (e.g., GDPR). • Deep understanding of Kubernetes architecture, components, and ecosystem, including Docker, etcd, kube-proxy, and kube-controller-manager. • Proficiency in container orchestration concepts and IaC with hands-on experience in tools such as Helm, Terraform.

Benefits

• Participate in on-call rotation and respond to production incidents in a timely manner. • Participate in post-incident reviews and implement preventive measures to mitigate future incidents. • Implement and maintain security best practices throughout the infrastructure stack, ensuring compliance with industry standards and regulations. • Monitor and identify security vulnerabilities in infrastructure and container runtime and mitigate them

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com