Post a Job Affiliates

Search Remote Jobs

FactSet

Website LinkedIn All Job Openings

fintech • portfolio management • investment management • investment banking • wealth management

10,000+ employees

Founded 1982

💸 Finance

💳 Fintech

☁️ SaaS

Lead Site Reliability Engineer

November 27

🇬🇧 United Kingdom – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🇬🇧 UK Skilled Worker Visa Sponsor

Ansible

AWS

Azure

Chef

Cloud

Distributed Systems

DNS

Docker

Google Cloud Platform

Kubernetes

Puppet

Python

TCP/IP

Terraform

Apply Now

FactSet

Website LinkedIn All Job Openings

fintech • portfolio management • investment management • investment banking • wealth management

10,000+ employees

Founded 1982

💸 Finance

💳 Fintech

☁️ SaaS

Description

• Collaborate with development, operations, and product teams to define, review, and implement reliability standards and best practices. • Design, implement, and maintain highly available and scalable architectures for our applications and infrastructure. • Develop and enhance automated tools and frameworks to optimize system monitoring, deployment, and recovery. • Troubleshoot and resolve complex issues throughout the entire software stack, including networking, databases, and distributed systems. • Conduct performance analysis and capacity planning to ensure system scalability and resource optimization. • Take a proactive approach to continuously improving reliability. • Participate in incident response, root cause analysis, and postmortem activities to identify and rectify system failures. • Collaborate with cross-functional teams to implement and improve CI/CD pipelines, ensuring reliable and efficient software releases. • Stay up-to-date with emerging technologies and industry trends, actively contributing to ongoing system improvements. • Participate in on-call rotation.

Requirements

• Bachelor's degree in Computer Science, Engineering, or equivalent practical experience. • Proven experience deploying and managing large-scale distributed systems successfully. • Understanding of SRE concepts (error budgets, SLIs/SLOs, blameless postmortems) • Proficiency in programming languages such as Python, C++, or Go • Familiarity with monitoring and observability tools. • Excellent problem-solving skills and ability to troubleshoot complex issues efficiently. • Strong organizational and communication skills, with the ability to collaborate effectively in a cross-functional team environment. • Familiarity with security best practices and experience implementing security measures in a production environment. • Experience with modern infrastructure technologies and tools, including cloud platforms (AWS, Azure, GCP), containers (Docker, Kubernetes), and orchestration (Ansible, Chef, Puppet). • Solid understanding of networking protocols and technologies (TCP/IP, DNS, load balancing). • Demonstrated experience with infrastructure as code (IaC) and automation tools (e.g., Terraform, GitHub Actions).

Apply Now

Similar Jobs

Senior Site Reliability Engineer - .Net

November 27

Employment Hero

501 - 1000

☁️ SaaS

👥 HR Tech

🎯 Recruitment

Website LinkedIn All Job Openings

As a Senior Site Reliability Engineer, support Employment Hero’s payroll platform. Work on significant improvements while ensuring system reliability and security.

🇬🇧 United Kingdom – Remote

💰 Series F on 2022-02

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🇬🇧 UK Skilled Worker Visa Sponsor

AWS

Cloud

Terraform

.NET

Apply

View Job