6 days ago
AWS
Backbone
Cloud
Distributed Systems
Docker
Grafana
JavaScript
Jenkins
Kubernetes
Prometheus
Python
SQL
Terraform
Go
• Design, implement, and maintain Cloud infrastructure on AWS and Google Cloud platforms. • Ensure the scalability, performance, and reliability of our Kubernetes-based distributed database systems. • Collaborate with developers to write efficient, production-grade code in Golang to automate infrastructure management and improve system operations. • Optimize and automate CI/CD pipelines, deployment processes, and monitoring systems to support our production environment. • Develop strategies for disaster recovery, high availability, and fault tolerance. • Proactively identify system bottlenecks, troubleshoot, and resolve issues across the stack (network, OS, cloud infrastructure). • Implement monitoring, logging, and alerting systems to ensure visibility into system health and performance. • Participate in On-call rotations to support critical production systems and respond to incidents. • Collaborate with cross-functional teams to improve overall system reliability and scalability. • Collaborate with the Customer Success team to resolve customer issues.
• 5+ years of proven experience as an SRE or DevOps Engineer in a Cloud-native environment (AWS or GPS). • Proficiency with Kubernetes in managing large-scale, distributed systems. • Solid understanding of networking, security practices, and troubleshooting methods. • Understanding of Linux internals (processes, environment variables etc.). • Familiarity with containerization technologies (e.g., Docker). • Knowledge of CI/CD practices and tools (Jenkins, CircleCI, etc.). • Familiarity with alerting, monitoring and observability tools (e.g., Prometheus, Grafana, ELK stack). • Strong troubleshooting and problem-solving skills, with the ability to address complex infrastructure issues. • Excellent communication and collaboration skills with a focus on continuous improvement and operational excellence. • Strong ability to self-organize and to work independently as part of a remote team. • Knowledge of version control systems, particularly Git. • Familiarity with programming languages such as Golang or Python.
Apply NowNovember 18
Join Redcare Pharmacy as a Senior DevOps Engineer to enhance infrastructure for high-traffic operations. Utilize Kubernetes, OpenStack, and GCP to maintain scalable systems.
November 13
Site Reliability Engineer at Vercel enhancing compute infrastructure for web solutions.
🇩🇪 Germany – Remote
💰 $150M Series D on 2021-11
⏰ Full Time
🟡 Mid-level
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
November 8
DevOps Engineer role at social impact startup holi improving cloud infrastructure.
October 31
Intermediate DevOps Engineer focusing on Microsoft Azure and Cloud technologies.
October 31
DevOps Engineer needed at LimeSurvey to enhance infrastructure and operations.