Senior Staff - Site Reliability Engineering, Observability

5 hours ago

🇺🇸 United States – Remote

💵 $198k - $270k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🗽 H1B Visa Sponsor

Apply Now
Logo of SentinelOne

SentinelOne

next-generation endpoint protection • endpoint detection & response • threat and malware prevention • exploit prevention • cybersecurity

Description

• About Us: SentinelOne is defining the future of cybersecurity through our XDR platform that automatically prevents, detects, and responds to threats in real-time. • We are seeking to hire a Senior Staff Engineer to join our Site Reliability Engineering (SRE) Team at SentinelOne. • This role can be 100% remote for individuals based in the US, or hybrid if local to a corporate office location. • As a Senior Staff SRE, you will architect and lead the implementation of advanced observability, automated triage, and self-healing capabilities within our microservices-based SaaS environment. • You will be instrumental in driving our organization’s evolution towards proactive, scalable incident management by enabling smart alert correlation, automated root cause analysis, and autonomous remediation systems. • Additionally, you will define and implement Service Level Objectives (SLOs) that align with business goals, ensuring our systems meet reliability standards and exceed customer expectations.

Requirements

• Extensive SRE Experience: Proven experience in architecting and implementing SRE solutions at scale within a microservices or distributed systems environment. • 10+ years of progressive professional experience, with 5+ years of recent experience supporting enterprise SaaS environments (or equivalent combination of education, experience, and certifications). • Technical Expertise: Deep knowledge of incident management, alert correlation, automated triage, self-healing strategies, and SLO frameworks. Strong understanding of observability platforms, including monitoring, logging, and tracing solutions. • Programming & Scripting: Proficient in one or more programming languages (e.g., Python, Go, Java) with experience in automation and scripting for incident management workflows. • Machine Learning & Data Analysis: Experience with machine learning, anomaly detection, or data analytics techniques for real-time alert correlation and triage systems. • Cloud Infrastructure: Expertise in cloud platforms (e.g., AWS, GCP, Azure) and container orchestration (e.g., Kubernetes), with experience in infrastructure-as-code (e.g., Terraform). • Problem-Solving & Decision-Making: Ability to make critical architectural decisions with a focus on business impact, reliability, and system performance.

Benefits

• Medical, Vision, Dental, 401(k), Commuter, Health and Dependent FSA • Unlimited PTO • Industry leading gender-neutral parental leave • Paid Company Holidays • Paid Sick Time • Employee stock purchase program • Disability and life insurance • Employee assistance program • Gym membership reimbursement • Cell phone reimbursement

Apply Now

Similar Jobs

7 hours ago

Senior Site Reliability Engineer at Bitwarden, focusing on operational success through cloud technology. Owner of the cloud infrastructure across multiple cloud providers.

🇺🇸 United States – Remote

💵 $140k - $160k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

7 hours ago

Drive CI/CD tool adoption and automation strategies at McDonald's Global Technology. Lead teams in enhancing service reliability and performance.

🇺🇸 United States – Remote

💵 $129.8k - $165.5k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

7 hours ago

Join Rubrik as a Senior Site Reliability Engineer to deploy security solutions and manage technologies.

🇺🇸 United States – Remote

💵 $172k - $258k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

8 hours ago

Join Degreed as a DevOps Engineer, focusing on cloud migration and infrastructure development.

🇺🇸 United States – Remote

💵 $120k - $140k / year

💰 Venture Round on 2021-07

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

8 hours ago

Join Netlify's SRE team to enhance infrastructure scalability and reliability for millions of users.

🇺🇸 United States – Remote

💵 $136k - $184k / year

💰 $105M Series D on 2021-11

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🗽 H1B Visa Sponsor

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com