Manager - Engineering

October 9

Apply Now

Description

β€’ Lead and mentor a team of software engineers, fostering a culture of collaboration, continuous learning, and high performance. β€’ Stay hands-on in designing, implementing, supporting, and monitoring cloud infrastructure, ensuring world-class reliability for high-stakes environments. β€’ Own and evolve best practices and standards for infrastructure-as-code across multiple product lines, evangelizing these practices within the team and broader engineering organization. β€’ Use data-driven insights to enhance service resiliency, minimize downtime, and optimize system performance. β€’ Oversee and participate in incident response and resolution to ensure minimal downtime and impact on critical services. β€’ Develop and maintain robust monitoring and alerting systems to guarantee high availability and proactively address performance concerns. β€’ Manage infrastructure scaling to handle varying loads, ensuring seamless operations across multiple regions. β€’ Automate operational processes to improve efficiency, reduce manual interventions, and optimize workflows. β€’ Collaborate with the Information Security and Compliance team to ensure all resources adhere to DoD security specifications. β€’ Drive team alignment on key metrics, such as SLOs, and lead efforts to meet or exceed those targets. β€’ Champion compliance efforts, including analyzing applications for alignment with CC SRG and NIST 800-53 standards, and implementing risk mitigation strategies.

Requirements

β€’ Proven leadership experience, with a demonstrated ability to manage, coach, and develop a high-performing team. β€’ Expert-level knowledge of Linux operating systems and cloud infrastructure, with deep experience in AWS. β€’ Strong hands-on experience with infrastructure-as-code tools, including Terraform, Ansible, and Packer. β€’ Familiarity with Docker, containers, and their orchestration in large-scale environments. β€’ Proficiency in programming/scripting languages (Python, Go, Java, C#, etc.). β€’ In-depth knowledge of system design, performance tuning, and troubleshooting in cloud environments. β€’ Experience with monitoring and logging tools such as Prometheus, Grafana, and Datadog. β€’ Strong understanding of incident management and disaster recovery practices, with the ability to guide the team through critical moments.

Benefits

β€’ A role in shaping the future of sports and a career that grows as the company grows. β€’ An exceptional culture of high achievement and teamwork. β€’ Supportive and humble colleagues who are some of the top problem solvers and innovators in the game. β€’ Financial security through competitive compensation and incentives. β€’ A comprehensive benefits plan, including medical, dental, vision, disability, life insurance, and a 401K match. β€’ Additional educational opportunities via Range can be used for courses, conferences, and other options. β€’ Unlimited paid time off. β€’ Company equity. β€’ 100% remote-optional work setting.

Apply Now

Similar Jobs

Built byΒ Lior Neu-ner. I'd love to hear your feedback β€” Get in touch via DM or lior@remoterocketship.com