Site Reliability Engineer

October 18

Apply Now
Logo of Red Hat

Red Hat

cloud computing • hybrid cloud management • Linux • open source • virtualization

10,000+

💰 Corporate Round on 1999-03

Description

• Develop, scale, and operate OpenShift managed cloud services. • Contribute code to increase the scalability and reliability of services. • Contribute software tests and participate in peer review to increase code quality. • Help and develop peers’ capabilities through knowledge sharing, mentoring, and collaboration. • Participate in a regular on-call schedule, including occasional paid weekends and holidays. • Practice sustainable incident response and blameless postmortems. • Resolve customer issues escalated from the Red Hat Global Support team. • Work within a small agile team to develop and improve SRE software, support peers, plan and self-improve.

Requirements

• A bachelor's degree in Computer Science or a related technical field involving software or systems engineering. • Experience programming in at least one of these languages: Python, Golang, Java, C, C++ or another object-oriented language. • Experience working with public clouds such as AWS, GCP, or Azure. • Experience troubleshooting an as-a-service offering (SaaS, PaaS, etc.). • Have the ability to collaboratively troubleshoot and solve problems in a team setting. • Experience working with complex distributed systems. • Direct experience with Kubernetes or OpenShift is a plus. • Demonstrated ability to debug, optimize code and automate routine tasks. • Basic understanding of Unix/Linux operating systems. • 5+ years of experience managing Linux servers running Red Hat Enterprise Linux (RHEL), CentOS, or Fedora hosted at a cloud provider such as Amazon Web Services (AWS), Google Compute Engine (GCE), or Microsoft Azure. • 3+ years of experience with enterprise systems monitoring; knowledge of Prometheus is a plus. • 3+ years of experience with enterprise configuration management software like Ansible by Red Hat, Puppet, or Chef. • 2+ years of experience programming with at least one object-oriented language; Golang, Java, or Python are preferred. • 2+ years of experience delivering a hosted service. • Demonstrated ability to quickly and accurately troubleshoot system issues. • Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP. • Solid communications skills and experience working directly with and presenting to customers. • 1+ year(s) of experience with Kubernetes is a plus. • 1+ year(s) of experience with docker-based containers is a plus.

Apply Now

Similar Jobs

December 15, 2023

DevOps Engineer at Leonardo.Ai to enhance cloud infrastructure for a creative AI platform.

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com