Senior Site Reliability Engineer

September 12

🇺🇸 United States – Remote

💵 $150k - $180k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Apply Now

Description

• Become part of an iconic brand that is set to revolutionize the electric pick-up truck & rugged SUV marketplace by achieving the following: • Contribute to the design, implementation, and maintenance of the overall cloud infrastructure platform using modern IaC (Infrastructure as Code) practices. • Work closely with software development and systems integration teams to build end-to-end solutions. • Design and build infrastructure utilizing container orchestration such as EKS/K8S • Provide and participate in an Incident Response process for establishing disaster recovery practices. • Ensure high uptime of critical systems. • Design and implement an availability reporting framework working with engineering teams to develop SLO and SLI measurements and targets • Participate in scaling and performance testing of critical components and services • Design and implement cloud infrastructure components, ensuring high availability, reliability, scalability, and performance. • Implement monitoring solutions to proactively identify and address potential issues. • Implement logging solutions to facilitate efficient troubleshooting and analysis. • Collaborate with security teams to ensure the platform meets industry standards and compliance requirements. • Collaborate with cross-functional teams, including product managers, developers, and QA engineers to ensure robust and reliable systems.

Requirements

• Bachelor's degree in computer science, information technology, or related field or equivalent work experience. • 8+ years of hands-on experience as a Site Reliability Engineer. • Proficient in building automation using languages such as Python, Shell, Ruby, and others. • Strong experience with containerization technologies (Docker, Kubernetes). • Expertise in configuration management tools (e.g., Ansible, Chef, Puppet). • Solid understanding of CI/CD concepts and tools (GitHub Actions, GitLab Pipelines, Harness.io, ArgoCD). • Multiple years of experience working with cloud platforms such as AWS, Azure, or Google Cloud. • Experience with monitoring and alerting tools such as Datadog, New Relic, SignalFX, Prometheus, AWS CloudWatch etc. • Experience with logging solutions AWS CloudWatch, ELK, Splunk or equivalent • Experience with infrastructure as code (Terraform, Pulumi, or CDK). • Excellent problem-solving and troubleshooting skills. When a problem occurs, you run towards it not away. • Effective communication and collaboration skills. You treat colleagues with respect. You have a desire for clean implementations but are also humble in discussing alternative solutions and options. • A teaching and coaching approach to guiding engineers and teams in approaches.

Benefits

• Competitive insurance including: • Medical, dental, vision and income protection plans • 401(k) program with: • An employer match and immediate vesting • Generous Paid Time Off including: • 20 days planned PTO, as accrued • 40 hours of unplanned PTO and 14 company or floating holidays, annually • Up to 16 weeks of paid parental leave for biological and adoptive parents of all genders • Paid leave for circumstances related to bereavement, jury duty, voting time, or military leave

Apply Now

Similar Jobs

September 12

GitLab

1001 - 5000

Maintain GitLab's user-facing services and production systems as Site Reliability Engineer.

🇺🇸 United States – Remote

💰 Secondary Market on 2020-11

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

September 11

GitLab

1001 - 5000

SRE focused on automating GitLab environments and keeping production systems running.

🇺🇸 United States – Remote

💵 $103.6k - $222k / year

💰 Secondary Market on 2020-11

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

September 7

Cerbo EHR

11 - 50

Manage AWS infrastructure for a healthcare SaaS EHR system.

September 6

Filevine

201 - 500

Site Reliability Engineer for Filevine's automated systems and reliability solutions.

🇺🇸 United States – Remote

💰 $108M Series D on 2022-04

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🗽 H1B Visa Sponsor

September 5

AKASA

201 - 500

Integrate monitoring solutions for AKASA's healthcare AI applications.

🇺🇸 United States – Remote

💵 $145k - $200k / year

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🗽 H1B Visa Sponsor

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com