September 12
🇺🇸 United States – Remote
💵 $150k - $180k / year
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
• Become part of an iconic brand that is set to revolutionize the electric pick-up truck & rugged SUV marketplace by achieving the following: • Contribute to the design, implementation, and maintenance of the overall cloud infrastructure platform using modern IaC (Infrastructure as Code) practices. • Work closely with software development and systems integration teams to build end-to-end solutions. • Design and build infrastructure utilizing container orchestration such as EKS/K8S • Provide and participate in an Incident Response process for establishing disaster recovery practices. • Ensure high uptime of critical systems. • Design and implement an availability reporting framework working with engineering teams to develop SLO and SLI measurements and targets • Participate in scaling and performance testing of critical components and services • Design and implement cloud infrastructure components, ensuring high availability, reliability, scalability, and performance. • Implement monitoring solutions to proactively identify and address potential issues. • Implement logging solutions to facilitate efficient troubleshooting and analysis. • Collaborate with security teams to ensure the platform meets industry standards and compliance requirements. • Collaborate with cross-functional teams, including product managers, developers, and QA engineers to ensure robust and reliable systems.
• Bachelor's degree in computer science, information technology, or related field or equivalent work experience. • 8+ years of hands-on experience as a Site Reliability Engineer. • Proficient in building automation using languages such as Python, Shell, Ruby, and others. • Strong experience with containerization technologies (Docker, Kubernetes). • Expertise in configuration management tools (e.g., Ansible, Chef, Puppet). • Solid understanding of CI/CD concepts and tools (GitHub Actions, GitLab Pipelines, Harness.io, ArgoCD). • Multiple years of experience working with cloud platforms such as AWS, Azure, or Google Cloud. • Experience with monitoring and alerting tools such as Datadog, New Relic, SignalFX, Prometheus, AWS CloudWatch etc. • Experience with logging solutions AWS CloudWatch, ELK, Splunk or equivalent • Experience with infrastructure as code (Terraform, Pulumi, or CDK). • Excellent problem-solving and troubleshooting skills. When a problem occurs, you run towards it not away. • Effective communication and collaboration skills. You treat colleagues with respect. You have a desire for clean implementations but are also humble in discussing alternative solutions and options. • A teaching and coaching approach to guiding engineers and teams in approaches.
• Competitive insurance including: • Medical, dental, vision and income protection plans • 401(k) program with: • An employer match and immediate vesting • Generous Paid Time Off including: • 20 days planned PTO, as accrued • 40 hours of unplanned PTO and 14 company or floating holidays, annually • Up to 16 weeks of paid parental leave for biological and adoptive parents of all genders • Paid leave for circumstances related to bereavement, jury duty, voting time, or military leave
Apply NowSeptember 12
1001 - 5000
Maintain GitLab's user-facing services and production systems as Site Reliability Engineer.
🇺🇸 United States – Remote
💰 Secondary Market on 2020-11
⏰ Full Time
🟡 Mid-level
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
September 11
1001 - 5000
SRE focused on automating GitLab environments and keeping production systems running.
🇺🇸 United States – Remote
💵 $103.6k - $222k / year
💰 Secondary Market on 2020-11
⏰ Full Time
🟡 Mid-level
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
September 6
201 - 500
Site Reliability Engineer for Filevine's automated systems and reliability solutions.
🇺🇸 United States – Remote
💰 $108M Series D on 2022-04
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
🗽 H1B Visa Sponsor
September 5
201 - 500
Integrate monitoring solutions for AKASA's healthcare AI applications.
🇺🇸 United States – Remote
💵 $145k - $200k / year
⏰ Full Time
🟠 Senior
⛑ DevOps & Site Reliability Engineer (SRE)
🗽 H1B Visa Sponsor