Principal Site Reliability Engineer

October 22

🇺🇸 United States – Remote

💵 $204k - $275k / year

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

🗽 H1B Visa Sponsor

Apply Now
Logo of SimSpace

SimSpace

Cyber Team Training • Cyber Ranges • Cyber Testing • Cyber Exercises • Cybersecurity

201 - 500

Description

• Develop and implement strategies for the monitoring and alerting of systems health, performance, and security • Develop and implement strategies for incident management, problem management, and change management • Create and maintain automation tools and scripts for configuration management, deployment, and maintenance of cloud-based infrastructure • Conduct performance and capacity planning to ensure the systems are operating at optimal levels • Implement and manage the disaster recovery plan, ensuring that the systems are backed up and can be recovered in case of an outage • Collaborate with development and operations teams to ensure that application and infrastructure changes are properly tested, deployed, and maintained • Evaluate new technologies and tools, and make recommendations for their adoption based on their impact on system performance, reliability, and scalability • Develop and maintain documentation of system configurations, processes, and procedures. • Providing technical leadership and mentoring to other engineers on the team

Requirements

• In depth experience in software development and/or infrastructure engineering, with a focus on site reliability and/or system administration • Strong experience in cloud computing, particularly with Amazon Web Services (AWS), Microsoft Azure, or Google Cloud Platform (GCP) • Must have extensive experience in containerization technologies such as Docker and Kubernetes • Strong experience in one of the scripting languages such as Python, Perl, or Ruby. • Proficiency in Terraform or Cloud Formation for managing infrastructure using IAC principles • Proficiency in one of the configuration management tools such as Puppet, Chef, or Ansible • Deep understanding of networking concepts such as TCP/IP, DNS, load balancing, and firewalls

Benefits

• Comprehensive benefits package that start on day one • 401k match with immediate vesting • Flex time, the time off you need when you need it • Equity options at hire and potential for additional based on performance • Generous employee referral bonus program • Peloton Interactive Wellness Program • LinkedIn Learning Membership • Monthly reimbursement for meaningful connections with other SimSpacers

Apply Now

Similar Jobs

October 17

interface.ai

51 - 200

Lead DevOps at interface.ai, an AI provider for financial institutions.

🇺🇸 United States – Remote

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

October 17

Join Fieldwire as Staff Software Engineer, SRE, enhancing construction management software.

🇺🇸 United States – Remote

💵 $195k - $220k / year

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

October 15

Vultr

51 - 200

Drive automation as Staff Site Reliability Engineer at Vultr's cloud platform.

🇺🇸 United States – Remote

💵 $120k - $135k / year

⏰ Full Time

🔴 Lead

⛑ DevOps & Site Reliability Engineer (SRE)

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com