Lead Site Reliability Engineer

September 7

Apply Now
Logo of Curology

Curology

Making effective skincare accessible. Forbes, Fortune, Great Place to Work Best Company. We're hiring!

Skincare • Dermatology • Acne • Telemedicine • Anti-Aging

501 - 1000

💰 Funding Round on 2021-03

Description

• Architect and lead the delivery of high-quality and reliable solutions through creative problem-solving and technical expertise. • Enable Engineers on your team to improve the quality and impact of their work. • Evangelize reliability-as-a-feature through monitoring, service-level objectives, automation, everything-as-code, and testing. • Provide technical leadership and guidance to the SRE team. • Set the direction for SRE projects, aligning them with organizational goals. • Define and instrument Service-Level Objectives to ensure excellent customer experience. • Lead initiatives to improve system resilience and scalability. • Host postmortems to share learnings and improve reliability. • Participate in an on-call rotation to assist in finding resolution during incidents.

Requirements

• 7+ years of experience building infrastructure solutions in AWS using Infrastructure-as-Code technologies such as Terraform or CloudFormation. • 7+ years of experience working with Docker containers and related orchestration technologies (such as Kubernetes or ECS). • 7+ years of experience building and deploying CI/CD pipelines. • Experience with AWS, Docker, Kubernetes, Terraform, Python, PHP, and Laravel • Experience with architectural patterns of large, high-scale applications, such as well-designed APIs and database schemas. • Experience leading projects and initiatives that are wide in scale and complex in nature. • Experience working collaboratively in cross-functional teams with engineers in product and data groups. • Deep technical expertise; Writes, debugs, and refactors code while being mindful of tradeoffs, scalability, architecture, and code cleanliness. • Demonstrates mastery of their craft to solve problems in automation, infrastructure, and/or developer tooling. • Reliability & Quality; Experience leveraging observability tooling and practices such as SLOs to help engineering teams own the reliability and quality of the software they build. • Leadership - Define and deliver large, complex projects that may include coordination with non-technical stakeholders. Help define the SRE function and be a champion for it throughout the organization.

Benefits

• Competitive salary and equity packages • Company Performance Incentive Plan • Comprehensive benefits: medical, dental, and vision insurance for employees; flexible spending account; 401k; mental health & wellness programs • $75 WFH stipend (remote employees) • Home office setup stipend (remote employees) • Minimum Time Off policy (unlimited PTO, with at least 3 weeks off) for exempt employees • 11 company observed holidays • Additional holidays: Curology days off (1 per quarter), 1 annual floating holiday (employee’s choice), and Gratitude Week (employees take the full week of Thanksgiving off; business critical teams observe different days) • Paid parental leave • Employee donation matching program • Company-sponsored events • Free subscription to Curology or Agency

Apply Now

Similar Jobs

September 7

Cerbo EHR

11 - 50

Manage AWS infrastructure for a healthcare SaaS EHR system.

September 6

Filevine

201 - 500

Site Reliability Engineer for Filevine's automated systems and reliability solutions.

🇺🇸 United States – Remote

💰 $108M Series D on 2022-04

⏰ Full Time

🟠 Senior

👨🏻‍🔧 Site Reliability Engineer (SRE)

🗽 H1B Visa Sponsor

September 5

AKASA

201 - 500

Integrate monitoring solutions for AKASA's healthcare AI applications.

🇺🇸 United States – Remote

💵 $145k - $200k / year

⏰ Full Time

🟠 Senior

👨🏻‍🔧 Site Reliability Engineer (SRE)

🗽 H1B Visa Sponsor

August 31

Regrello

11 - 50

Oversee release management for Regrello's SaaS and customer environments.

🇺🇸 United States – Remote

💰 $20M Series A on 2022-05

⏰ Full Time

🟡 Mid-level

🟠 Senior

👨🏻‍🔧 Site Reliability Engineer (SRE)

August 30

DriverReach

51 - 200

Site Reliability Engineer at DriverReach™, ensuring cloud application efficiency.

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com