Senior Site Reliability Engineer

20 hours ago

🇮🇪 Ireland – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability (SRE)

Apply Now
Logo of Oomnitza

Oomnitza

Enterprise Software • ITAM • IT Asset Management • IT Service Management • MDM

51 - 200

💰 $20M Series B on 2021-08

Description

• Oomnitza offers the industry’s most versatile Enterprise Technology Management platform that orchestrates and automates key business processes for IT. • Our SaaS solution, with agentless integrations, best practices and low-code workflows, enables enterprises to leverage their existing infrastructure systems and automate processes such as offboarding, onboarding, audit readiness, refresh forecasting and more, thereby reducing reliance on error-prone manual tasks and tickets. • We help some of the most well-known and innovative companies to improve efficiency, expedite audits, mitigate cyber risk and eliminate redundant IT spend. • At Oomnitza, we’re passionate about building software that solves problems. • We count on our Site Reliability Engineers (SREs) to empower our users with a rich feature set, high availability, and stellar performance level to pursue their missions - using DevSecOps methodologies. • Our dynamic and innovative team is growing and we are looking to add a highly motivated and experienced Site Reliability Engineer to the team. • As an experienced DevSecOps practitioner, we look to you to operate and deliver working systems based on insights gathered from massive scale data in real time, ensuring Oomnitza’s internal and external services are reliable while keeping an ever-watchful eye on our systems, capacity, and performance. • You’ll have the opportunity to experience the complex challenges of building and running large-scale, fault tolerant, and secure distributed microservice based systems worldwide. • Specifically, we are searching for someone who brings fresh ideas to the table, and demonstrates a unique and informed viewpoint. • Enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences at every interaction.

Requirements

• Extensive experience with container orchestration and managing production clusters, focusing on deployment, scaling, and troubleshooting within Kubernetes environments. Proven ability to set up and manage Kubernetes clusters effectively for enterprise applications. Experience with Amazon EKS is a plus. • Proficiency in tools like Ansible, Helm, and Kustomize for automating infrastructure provisioning, configuration, and deployment. Skilled in managing Kubernetes manifests and application releases to streamline processes and ensure consistency across various deployment environments. • Experience with Prometheus, Grafana, or similar to proactively track system health, detect anomalies, and optimize performance across the platform. • Deep knowledge of the AWS ecosystem, including EC2, S3, IAM, VPC, and other essential services for building and managing scalable infrastructure. • Hands-on experience with Terraform to provision and manage cloud resources, ensuring version control, repeatability, and efficiency in infrastructure deployment. • Familiarity with message queuing systems like RabbitMQ and Kafka, as well as managed queuing services such as AmazonMQ. Skilled in setting up, managing, and optimizing message brokers for high-throughput, reliable communication between distributed systems. • Strong background in managing MySQL databases and leveraging Amazon RDS for high availability, performance tuning, and secure database management in cloud environments. • Understanding of network design and security protocols to protect systems, enforce compliance, and meet industry-standard audit requirements. • Experience ensuring high uptime agreements for critical systems, implementing strategies for fault tolerance, disaster recovery, and proactive monitoring to maintain service availability and minimize downtime. • Proven ability to work effectively with cross-functional teams from multiple departments to achieve project goals and execute project plans in an orderly and efficient manner. • Ability to develop and maintain code in one or more high-level programming languages such as Python, Go, or JavaScript. Familiarity with modern development tools and CI/CD pipelines to automate testing, deployment, and monitoring. • A proactive mindset towards identifying system issues, areas for process improvement, and resolving performance bottlenecks.

Benefits

• Dental & Vision Insurance • Employee equity plan • Health Insurance for your spouse and dependents • Pension, Life insurance and Income protection • Remote working & flexible work schedules Working from home equipment allowance • Choice of preferred equipment, Mac or PC. • Regular, fun social events and workshops.

Apply Now

Similar Jobs

Yesterday

CaptivateIQ

201 - 500

Develop infrastructure and reliability solutions for CaptivateIQ's agile commission management platform.

🇮🇪 Ireland – Remote

💵 €96k - €120k / year

💰 $100M Series C on 2022-01

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability (SRE)

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com

Join our Facebook group

👉 Remote Jobs Network