Senior SRE Engineer

November 7

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Apply Now
Logo of Zeta Global

Zeta Global

CRM • Email Marketing • Display Advertising • Cross-Channel Marketing • Social Media Marketing

1001 - 5000

🔥 Funding within the last year

💰 Post-IPO Debt on 2024-09

Description

• Zeta Global is looking for a dynamic Senior SRE Engineer to join the team. • Implement and manage SLOs, SLIs, and error budgets. • Lead and promote postmortems, ensuring robust root cause analysis to drive continuous system improvement. • Analyze historical data to identify improvement areas. • Implement full observability across systems using modern tools like OpenTelemetry (OTEL), Honeycomb, New Relic, Datadog, or similar. • Reduce toil through runbook automation. • Record and track key MTTx metrics (MTTA, MTTR, MTTF, etc.). • Lead design sessions on capacity planning, reliability by design, automation, and alerting. • Collaborate with product teams to enhance system reliability. • Engage in strategic initiatives for capacity, reliability, and automation, ensuring alignment with business goals.

Requirements

• 7+ years of experience as an SRE. • 3+ years of software development experience, with a strong emphasis on automation. • Hands-on experience with Infrastructure as Code (IaC) tools. • Experience with distributed systems and microservices architecture (MSA). • Production experience with distributed tracing. • Proven skills in Python and Bash scripting. • Solid understanding of SLIs, SLOs, and error budgets. • Hands-on experience with CI/CD platforms (GitOps, GitLab, Jenkins, ArgoCD, etc.). • Expertise in incident management and root cause analysis. • Knowledge of modern deployment strategies (Canary, Blue-Green, etc.). • Familiarity with resiliency patterns (circuit breakers, retry mechanisms, load balancing, etc.). • Experience with SQL and NoSQL databases and understanding of distributed systems. • Proficiency in statistical analysis applied to metrics. • Experience with high-performance, low-latency systems. • Proven experience in cloud cost optimization strategies. • Experience with Kafka or other distributed messaging systems. • Strong understanding of security and compliance standards within DevOps/SRE environments.

Benefits

• Unlimited PTO • Excellent medical, dental, and vision coverage • Employee Equity and Stock Purchase Plan • Employee Discounts, Virtual Wellness Classes, and Pet Insurance And more!!

Apply Now

Similar Jobs

November 7

Worldpay

5001 - 10000

DevOps Engineer for Worldpay, enhancing platform resilience and automation.

🇺🇸 United States – Remote

💵 $130k - $218.4k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

November 6

Senior SRE role at Provision IAM focusing on cloud infrastructure and automation.

🇺🇸 United States – Remote

💵 $95k - $125k / year

💰 $925k Seed Round on 2022-10

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

November 6

Senior DevOps Engineer for healthcare technology firm improving software delivery and deployments.

🇺🇸 United States – Remote

💰 Private Equity Round on 2018-07

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

November 6

Oddball

51 - 200

Senior DevOps Engineer for a Federal program improving lives through quality software.

🇺🇸 United States – Remote

💵 $120k - $160k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

November 6

Lead SRE projects to optimize Centene's platform infrastructure.

🇺🇸 United States – Remote

💵 $98.9k - $183.1k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com