Site Reliability Engineer

4 days ago

Apply Now
Logo of Red Hat

Red Hat

cloud computing • hybrid cloud management • Linux • open source • virtualization

10,000+

💰 Corporate Round on 1999-03

Description

• Develop, scale, and operate OpenShift managed cloud services. • Contribute code to increase the scalability and reliability of services. • Contribute software tests and participate in peer review to increase code quality. • Help and develop peers’ capabilities through knowledge sharing, mentoring, and collaboration. • Participate in a regular on-call schedule, including occasional paid weekends and holidays. • Practice sustainable incident response and blameless postmortems. • Resolve customer issues escalated from the Red Hat Global Support team. • Work within a small agile team to develop and improve SRE software, support peers, plan and self-improve.

Requirements

• A bachelor's degree in Computer Science or a related technical field involving software or systems engineering. • Experience programming in at least one of these languages: Python, Golang, Java, C, C++ or another object-oriented language. • Experience working with public clouds such as AWS, GCP, or Azure. • Experience troubleshooting an as-a-service offering (SaaS, PaaS, etc.). • Have the ability to collaboratively troubleshoot and solve problems in a team setting. • Experience working with complex distributed systems. • Direct experience with Kubernetes or OpenShift is a plus. • Demonstrated ability to debug, optimize code and automate routine tasks. • Basic understanding of Unix/Linux operating systems. • 5+ years of experience managing Linux servers running Red Hat Enterprise Linux (RHEL), CentOS, or Fedora hosted at a cloud provider such as Amazon Web Services (AWS), Google Compute Engine (GCE), or Microsoft Azure. • 3+ years of experience with enterprise systems monitoring; knowledge of Prometheus is a plus. • 3+ years of experience with enterprise configuration management software like Ansible by Red Hat, Puppet, or Chef. • 2+ years of experience programming with at least one object-oriented language; Golang, Java, or Python are preferred. • 2+ years of experience delivering a hosted service. • Demonstrated ability to quickly and accurately troubleshoot system issues. • Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP. • Solid communications skills and experience working directly with and presenting to customers. • 1+ year(s) of experience with Kubernetes is a plus. • 1+ year(s) of experience with docker-based containers is a plus.

Apply Now

Similar Jobs

September 29

Avetta

501 - 1000

DevOps Engineer at Avetta enhancing mobile and cloud infrastructure.

🇦🇺 Australia – Remote

💰 Private Equity Round on 2019-02

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

July 26

Implement cloud architecture and support execution of cloud implementation strategy.

July 25

Implements technical cloud strategies and provides support for IT infrastructure.

December 15, 2023

DevOps Engineer at Leonardo.Ai to enhance cloud infrastructure for a creative AI platform.

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com