Site Reliability Engineer

April 10

Apply Now

Description

• Work with large-scale systems (handling millions of requests per second, serving millions of users, across multiple cloud providers). • Develop solutions to enhance performance, availability, security, and cost-effectiveness. • Keep us up, keep us fast, and keep our dev teams productive ensuring that every peer release improves performance across the spectrum including quality, security, uptime, speed-to-deliver, threat detection, and customer engagement. • Source improvement ideas, priority and capabilities from customers, the internal community, new and existing system metrics. Make decisions rapidly. • Be creative and desire an environment where you can directly create value and be a force to improve the experience for our customers.

Requirements

• Strong programming skills in one or more of the following languages: Python, JavaScript, Go. • Background in software engineering with expertise in backend development within Kubernetes-based systems. • Hands-on experience in development and orchestration within high-scale, high-uptime, and high-reliability environments. • Minimum of six years of hands-on experience in related roles (engineering, DevOps, SRE). • Familiarity with distributed systems, including queue-first architectures and sharding. • Demonstrated engineering expertise, including gathering requirements, problem-solving, and making recommendations. • Preferred: Familiarity with security frameworks, attack vectors, botnets, and impact analysis.

Benefits

• Fully remote position with flexible working hours • An inspiring team of colleagues spread all over the world • Pleasant, modern development and deployment workflows: ship early, ship often • High impact: lots of users, happy customers, high growth, and cutting edge R&D • Flat organization, direct interaction with customer teams

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com