Site Reliability Engineer

November 27

🇺🇸 United States – Remote

💵 $140k - $180k / year

⏰ Full Time

🟠 Senior

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

Apply Now

Description

• Responsible for building and operating core infrastructure including bare metal provisioning, telemetry, storage, and orchestration. • Own the care and provision of thousands of GPU servers and related support infrastructure. • Define company’s culture and ensure mission success. • Design and roll out new platforms to minimize incidents and enable features. • Collaborate with network engineering, software development, and customer support. • Participate in the SRE on-call rotation (1 week on, 5+ weeks off).

Requirements

• 8+ years working with Linux as a server / hosting platform, extra points for Ubuntu experience. • 5+ years experience with AWS. • 2+ years experience with Kubernetes and strong container fundamentals. • 2+ years experience with Terraform and Ansible • 2+ years with network attached storage management (via NFS, ceph, or other protocols). Extra points for experience with VAST storage systems. • Experience working in a Slack-first, asynchronous remote work environment. • Experience with monitoring systems (Prometheus, ELK stack). • Familiarity with the gitops workflow. • Software development experience using Python, Go, bash, or other languages for the purposes of automation & connecting systems & APIs together. • Deep networking fundamentals, extra points for experience with datacenter level networks, 400Gb ethernet, and Infiniband. • Experience architecting, building, and delivering complex systems from 0 to 1. • Adept at balancing pragmatic development and ideal architectures. Effective at navigating tradeoffs between design, risk, cost, and outcomes. • Comfortable with navigating ambiguity. • Strong written and oral communication.

Benefits

• 5% 401k match • Comprehensive health insurance with 100% of premiums covered by Voltage Park

Apply Now

Similar Jobs

November 23

Lead the DevOps team at Everi for enterprise solutions in a remote role.

🇺🇸 United States – Remote

💵 $130.7k - $155.2k / year

⏰ Full Time

🟠 Senior

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

November 19

Join Coalfire as a Technical Manager, leading cloud solutions to enhance cybersecurity operations and client support.

🇺🇸 United States – Remote

💵 $94k - $163k / year

⏰ Full Time

🟠 Senior

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

November 19

Join IDEXX as a Site Reliability Engineer to support cloud enterprise systems with reliable infrastructure solutions.

🇺🇸 United States – Remote

💰 Seed Round on 1984-01

⏰ Full Time

🟠 Senior

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

November 19

Join VetsEZ as a remote Expert DevOps Engineer. Support Department of Veterans Affairs projects and healthcare technology innovation.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

November 13

Remote role for a DevOps Engineer supporting VA healthcare technology efforts.

🇺🇸 United States – Remote

⏰ Full Time

🟠 Senior

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com