Join our Facebook group

👉 Remote Jobs Network

Senior Site Reliability Engineer

August 14

🇺🇸 United States – Remote

đź’µ $110k - $177k / year

⏰ Full Time

đźź  Senior

👨🏻‍🔧 Site Reliability Engineer (SRE)

đź—˝ H1B Visa Sponsor

Apply Now
Logo of Platform Science

Platform Science

Making it easy for enterprise trucking fleets to develop, deploy, and manage mobile devices and applications.

201 - 500

Description

• Develop and enhance the Continuous Integration/Continuous Deployment (CI/CD) pipelines, along with refining release management processes and associated toolsets • Maintain Helm charts to streamline application deployment and management • Establish standardized observability solutions to empower development teams in efficiently managing their applications • Lead the effort in promoting and prioritizing reliability, driving achievement of uptime goals, and mentoring colleagues in SRE best practices • Conduct comprehensive Production Readiness Reviews, working with teams to identify and establish Service Level Indicators and Service Level Objectives (SLIs/SLOs), and ensure high-quality and dependable services • Design and develop software solutions to address operational challenges effectively to improve system stability and reliability • Fulfill on-call duties, providing expert support to development teams for mission-critical applications in production environments • Improve the resiliency of applications and systems using chaos engineering

Requirements

• Possess 5+ years of hands-on experience in SRE or Platform Engineering roles • Demonstrated expertise (2+ years) with automation technologies like Jenkins, ArgoCD, or similar • Experience with Kubernetes (2+ years), Helm, and Docker within production environments • Proficiency with current software development lifecycle (SDLC) concepts and best practices, CI/CD pipelines, and test-driven development • Experience with AWS, encompassing proficiency in EKS, IAM, autoscaling, networking, and load balancing/request routing in a production environment • Proficient in Python, Bash, Nodejs, and/or Go • Proficient with distributed tracing methodologies and observability tools such as Prometheus, ELK, or Datadog • Strong emphasis on documentation and fostering knowledge-sharing practices within the team and organization • Track record of successfully training and mentoring engineers • Proven expertise in optimizing performance and managing costs within cloud environments • Sound understanding of SLI/SLO concepts and adherence to SRE best practices • Bachelors in Computer Science or related field

Benefits

• Medical, dental, and vision insurance • Short-term and long-term disability insurances • AD&D and life insurance • 401k plan • Paid vacation, sick leave and holidays • Six weeks of paid parental leave

Apply Now

Similar Jobs

August 14

Signify Health

1001 - 5000

Improve system stability and scalability while collaborating across product teams.

🇺🇸 United States – Remote

đź’µ $72.1k - $125.6k / year

⏰ Full Time

🟡 Mid-level

đźź  Senior

👨🏻‍🔧 Site Reliability Engineer (SRE)

đź—˝ H1B Visa Sponsor

August 14

hims & hers

201 - 500

Build reliable web experiences to enhance users' health through telehealth innovations.

🇺🇸 United States – Remote

đź’µ $150k - $175k / year

⏰ Full Time

đźź  Senior

👨🏻‍🔧 Site Reliability Engineer (SRE)

August 14

hims & hers

201 - 500

Build a reliable web experience while revolutionizing telehealth solutions.

🇺🇸 United States – Remote

đź’µ $103k - $117k / year

⏰ Full Time

🟡 Mid-level

đźź  Senior

👨🏻‍🔧 Site Reliability Engineer (SRE)

August 13

Elevate infrastructure resilience and optimize system performance for content-creator community.

🇺🇸 United States – Remote

đź’µ $161k - $180k / year

⏰ Full Time

đźź  Senior

👨🏻‍🔧 Site Reliability Engineer (SRE)

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com