Senior SRE Engineer

November 7

Apply Now
Logo of Zeta Global

Zeta Global

CRM • Email Marketing • Display Advertising • Cross-Channel Marketing • Social Media Marketing

1001 - 5000 employees

Founded 2007

☁️ SaaS

🤖 Artificial Intelligence

🤝 B2B

🔥 Funding within the last year

💰 Post-IPO Debt on 2024-09

Description

• Zeta Global is looking for a dynamic Senior SRE Engineer to join the team. • Implement and manage SLOs, SLIs, and error budgets. • Lead and promote postmortems, ensuring robust root cause analysis to drive continuous system improvement. • Analyze historical data to identify improvement areas. • Implement full observability across systems using modern tools like OpenTelemetry (OTEL), Honeycomb, New Relic, Datadog, or similar. • Reduce toil through runbook automation. • Record and track key MTTx metrics (MTTA, MTTR, MTTF, etc.). • Lead design sessions on capacity planning, reliability by design, automation, and alerting. • Collaborate with product teams to enhance system reliability. • Engage in strategic initiatives for capacity, reliability, and automation, ensuring alignment with business goals.

Requirements

• 7+ years of experience as an SRE. • 3+ years of software development experience, with a strong emphasis on automation. • Hands-on experience with Infrastructure as Code (IaC) tools. • Experience with distributed systems and microservices architecture (MSA). • Production experience with distributed tracing. • Proven skills in Python and Bash scripting. • Solid understanding of SLIs, SLOs, and error budgets. • Hands-on experience with CI/CD platforms (GitOps, GitLab, Jenkins, ArgoCD, etc.). • Expertise in incident management and root cause analysis. • Knowledge of modern deployment strategies (Canary, Blue-Green, etc.). • Familiarity with resiliency patterns (circuit breakers, retry mechanisms, load balancing, etc.). • Experience with SQL and NoSQL databases and understanding of distributed systems. • Proficiency in statistical analysis applied to metrics. • Experience with high-performance, low-latency systems. • Proven experience in cloud cost optimization strategies. • Experience with Kafka or other distributed messaging systems. • Strong understanding of security and compliance standards within DevOps/SRE environments.

Benefits

• Unlimited PTO • Excellent medical, dental, and vision coverage • Employee Equity and Stock Purchase Plan • Employee Discounts, Virtual Wellness Classes, and Pet Insurance And more!!

Apply Now

Similar Jobs

November 7

DevOps Engineer for Worldpay, enhancing platform resilience and automation.

🇺🇸 United States – Remote

💵 $130k - $218.4k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

November 6

Senior DevOps Engineer for a Federal program improving lives through quality software.

🇺🇸 United States – Remote

💵 $120k - $160k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

November 1

Senior Site Reliability Engineer at Assured, modernizing insurance through software solutions.

🇺🇸 United States – Remote

💵 $165k - $185k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com