Lead Site Reliability Engineer

November 27

Apply Now
Logo of FactSet

FactSet

fintech β€’ portfolio management β€’ investment management β€’ investment banking β€’ wealth management

Description

β€’ Collaborate with development, operations, and product teams to define, review, and implement reliability standards and best practices. β€’ Design, implement, and maintain highly available and scalable architectures for our applications and infrastructure. β€’ Develop and enhance automated tools and frameworks to optimize system monitoring, deployment, and recovery. β€’ Troubleshoot and resolve complex issues throughout the entire software stack, including networking, databases, and distributed systems. β€’ Conduct performance analysis and capacity planning to ensure system scalability and resource optimization. β€’ Take a proactive approach to continuously improving reliability. β€’ Participate in incident response, root cause analysis, and postmortem activities to identify and rectify system failures. β€’ Collaborate with cross-functional teams to implement and improve CI/CD pipelines, ensuring reliable and efficient software releases. β€’ Stay up-to-date with emerging technologies and industry trends, actively contributing to ongoing system improvements. β€’ Participate in on-call rotation.

Requirements

β€’ Bachelor's degree in Computer Science, Engineering, or equivalent practical experience. β€’ Proven experience deploying and managing large-scale distributed systems successfully. β€’ Understanding of SRE concepts (error budgets, SLIs/SLOs, blameless postmortems) β€’ Proficiency in programming languages such as Python, C++, or Go β€’ Familiarity with monitoring and observability tools. β€’ Excellent problem-solving skills and ability to troubleshoot complex issues efficiently. β€’ Strong organizational and communication skills, with the ability to collaborate effectively in a cross-functional team environment. β€’ Familiarity with security best practices and experience implementing security measures in a production environment. β€’ Experience with modern infrastructure technologies and tools, including cloud platforms (AWS, Azure, GCP), containers (Docker, Kubernetes), and orchestration (Ansible, Chef, Puppet). β€’ Solid understanding of networking protocols and technologies (TCP/IP, DNS, load balancing). β€’ Demonstrated experience with infrastructure as code (IaC) and automation tools (e.g., Terraform, GitHub Actions).

Apply Now

Similar Jobs

November 27

As a Senior Site Reliability Engineer, support Employment Hero’s payroll platform. Work on significant improvements while ensuring system reliability and security.

November 27

Seeking a Senior AWS DevOps Engineer to enhance regulatory reporting quality in financial services.

Built byΒ Lior Neu-ner. I'd love to hear your feedback β€” Get in touch via DM or lior@remoterocketship.com