Site Reliability Engineer

October 18

Apply Now

Description

• Participate in agile sprints and associated ceremonies • Drive innovation and platform evolution • Scale cloud infrastructure based on Kubernetes • Provide reliable, predictable deployment and maintenance of distributed systems • Adhere to security best practices • Write and design automation, monitoring, diagnosing, and debugging tooling • Coordinate and participate in production support and on-call rotations • Conduct incident management and contribute to retrospectives/postmortems as needed • Work cross-functionally with cloud operations, development, and product teams

Requirements

• Extensive knowledge working in a public cloud and/or hosted datacenter environment (Azure preferred) • Experience with highly available and scalable systems • Familiarity with Google SRE concepts • Ability to manage Windows Servers, AD, and .NET applications (C#.NET/ASP) • Experience with IIS and MS SQL configuration and support • Familiarity with Linux Server stacks (Ubuntu/Debian distributions preferred) • Basic to intermediate knowledge of networking (Subnetting, CIDR) • Experience with one cloud provisioning platform (HashiCorp Terraform preferred) • Experience with at least one configuration management platform (Chef preferred) • Familiarity with containerization/clustering technologies (e.g Docker, Azure Kubernetes) • Familiarity with alerting and monitoring tools (Prometheus/Grafana or ELK/EFK preferred) • Working knowledge of CI/CD • Experience writing technical documentation and SOP’s for internal stakeholders • Willingness to collaborate with other teams, providing root cause analysis and problem analysis as needed • A Bachelor’s degree in Computer Engineering or related field (or equivalent experience) • Proficiency in at least one scripting language (PowerShell, Python, Ruby, etc)

Benefits

• Comprehensive Health/Vision/Dental/Life Insurance • 401k Retirement Savings Plan with a company match up to 4% • Annual performance-based bonus • HealthJoy healthcare concierge service • Enhanced leave for expecting parents: 20 weeks 100% paid for primary leave, 10 weeks 100% paid for secondary leave • Flexible time off policy • Multiple company wellness days • Free access to the Healthy Minds app

Apply Now

Similar Jobs

October 12

Support implementation of SRE practices for enhanced system reliability.

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com