Site Reliability Engineer

February 8

Apply Now

Description

β€’ Working proactively with engineering teams to help them set SLOs and implement best practices for logging and telemetry collection β€’ Design, implement and maintain the tools and systems that support service reliability, monitoring, and alerting β€’ Participating in a 24x7 on-call rotation supporting the health of our services β€’ Driving the incident management process and support a blameless post-mortem culture β€’ Participating in application design consulting and capacity planning β€’ Defining and formalizing SRE practices and help guide the overall reliability engineering direction β€’ Providing mentorship both formally and informally to engineers at ONE β€’ Continuously optimizing systems and workflows by improving architecture, infrastructure, automation, CI/CD, and observability β€’ Combining software and systems knowledge to engineer high-volume distributed systems in a reliable, scalable, and fault-tolerant manner

Requirements

β€’ 5+ years of relevant industry experience with a focus on distributed cloud native systems design, observability, operation, maintenance, and troubleshooting β€’ 5+ years operational experience with an observability platform like Datadog, Splunk, Prometheus/Grafana, or AppDynamics β€’ Fluency in one or more programming languages (e.g. Python, Typescript, Go) β€’ A strong conviction in software development best practices, including version control, automated testing, and continuous integration and delivery β€’ You're self-motivated, inquisitive, and always looking to learn new technologies β€’ You’re a great teammate who communicates clearly and transparently β€’ The Triple H Factor: Humble, Hungry and Honest β€’ An act-like-an-owner mentality. We have a bias toward taking action.

Benefits

β€’ Competitive cash β€’ Benefits effective on day one β€’ Early access to a high potential, high growth fintech β€’ Generous stock option packages in an early-stage startup β€’ Remote friendly (anywhere in the US) and office friendly - you pick the schedule β€’ Flexible time off programs - vacation, sick, paid parental leave, and paid caregiver leave β€’ 401(k) plan with match

Apply Now

Similar Jobs

December 14, 2023

Mid-level to Senior AWS Full Stack DevOps Engineer at an early stage startup.

πŸ‡ΊπŸ‡Έ United States – Remote

πŸ’΅ $150k - $170k / year

⏰ Full Time

🟑 Mid-level

🟠 Senior

β›‘ DevOps & Site Reliability Engineer (SRE)

Built byΒ Lior Neu-ner. I'd love to hear your feedback β€” Get in touch via DM or lior@remoterocketship.com