Site Reliability Engineer - Europe

6 days ago

🇩🇪 Germany – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Apply Now
Logo of ArangoDB

ArangoDB

NoSQL • Document Store • GraphDB • Key Value Store • Multi-Model

Description

• Design, implement, and maintain Cloud infrastructure on AWS and Google Cloud platforms. • Ensure the scalability, performance, and reliability of our Kubernetes-based distributed database systems. • Collaborate with developers to write efficient, production-grade code in Golang to automate infrastructure management and improve system operations. • Optimize and automate CI/CD pipelines, deployment processes, and monitoring systems to support our production environment. • Develop strategies for disaster recovery, high availability, and fault tolerance. • Proactively identify system bottlenecks, troubleshoot, and resolve issues across the stack (network, OS, cloud infrastructure). • Implement monitoring, logging, and alerting systems to ensure visibility into system health and performance. • Participate in On-call rotations to support critical production systems and respond to incidents. • Collaborate with cross-functional teams to improve overall system reliability and scalability. • Collaborate with the Customer Success team to resolve customer issues.

Requirements

• 5+ years of proven experience as an SRE or DevOps Engineer in a Cloud-native environment (AWS or GPS). • Proficiency with Kubernetes in managing large-scale, distributed systems. • Solid understanding of networking, security practices, and troubleshooting methods. • Understanding of Linux internals (processes, environment variables etc.). • Familiarity with containerization technologies (e.g., Docker). • Knowledge of CI/CD practices and tools (Jenkins, CircleCI, etc.). • Familiarity with alerting, monitoring and observability tools (e.g., Prometheus, Grafana, ELK stack). • Strong troubleshooting and problem-solving skills, with the ability to address complex infrastructure issues. • Excellent communication and collaboration skills with a focus on continuous improvement and operational excellence. • Strong ability to self-organize and to work independently as part of a remote team. • Knowledge of version control systems, particularly Git. • Familiarity with programming languages such as Golang or Python.

Apply Now

Similar Jobs

November 18

Join Redcare Pharmacy as a Senior DevOps Engineer to enhance infrastructure for high-traffic operations. Utilize Kubernetes, OpenStack, and GCP to maintain scalable systems.

🇩🇪 Germany – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

November 13

Site Reliability Engineer at Vercel enhancing compute infrastructure for web solutions.

🇩🇪 Germany – Remote

💰 $150M Series D on 2021-11

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

October 31

DevOps Engineer needed at LimeSurvey to enhance infrastructure and operations.

🇩🇪 Germany – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com