Senior HA-Systems Engineer

13 hours ago

Apply Now
Logo of Datasite

Datasite

M&A β€’ Virtual Data Rooms β€’ Due Diligence β€’ Corporate Development β€’ AI

Description

β€’ Design and Architecture: Architect and build highly available, fault-tolerant systems to support mission-critical applications. β€’ Collaborate with cross-functional teams to design scalable, robust, and secure cloud-based solutions. β€’ Develop strategies for disaster recovery, data replication, and failover processes. β€’ System Performance and Optimization: Analyze system performance, identify bottlenecks, and implement optimizations to ensure optimal uptime and performance. β€’ Conduct load testing, capacity planning, and performance tuning to meet high availability requirements. β€’ Utilize monitoring tools to proactively detect issues and minimize downtime. β€’ Automation and Infrastructure as Code: Develop and maintain infrastructure as code (IaC) using tools like Terraform and Ansible. β€’ Implement automation for deployments, scaling, and configuration management to reduce manual intervention and increase system reliability. β€’ Incident Management and Troubleshooting: Lead incident response and root cause analysis for system outages, ensuring quick resolution and prevention of future incidents. β€’ Build and maintain robust monitoring, alerting, and diagnostic systems for proactive issue identification. β€’ Mentorship and Leadership Provide technical leadership, mentorship, and guidance to junior engineers and other team members. β€’ Stay updated on the latest trends in high availability and distributed systems, and share knowledge within the team.

Requirements

β€’ Bachelor's or Master's degree in Computer Science, Engineering, or a related field (or equivalent experience). β€’ 8+ years of experience in systems engineering, infrastructure architecture, or related fields. β€’ Proven track record of designing and implementing highly available, fault-tolerant systems in cloud or on-prem environments. β€’ Experience with distributed systems, microservices architecture, and high availability patterns (e.g., active-active, active-passive). β€’ Proficient in cloud platforms (Azure, GCP, AWS) or on-prem data centers and cloud-native technologies. β€’ Deep knowledge and understanding of Linux systems β€’ Experience using monitoring and observability tools (Prometheus, Grafana, Loki, etc.). β€’ Strong coding/scripting skills in Python, Go, or Shell for automation. β€’ Excellent problem-solving skills with a focus on resilience and scalability. β€’ Strong communication skills with the ability to convey complex technical concepts to diverse stakeholders. β€’ Ability to work independently and take ownership of projects from inception to deployment.

Benefits

β€’ Competitive salary and performance-based bonuses. β€’ Comprehensive benefits package (health, dental, vision, 401k match). β€’ Opportunities for professional growth and career advancement. β€’ Flexible work environment, including remote options. β€’ A dynamic, collaborative team environment where your ideas matter.

Apply Now

Similar Jobs

13 hours ago

As a lead systems engineer, oversee project planning and implementation to enhance IT solutions for Centene's 28 million members, improving health outcomes worldwide.

3 days ago

Analyze and maintain systems for WVU Health System, optimizing workflows and ensuring safety. Join a team dedicated to improving healthcare through effective technology.

3 days ago

Join NGL as a Senior Systems Analyst, delivering quality systems analysis and support. Provide business solutions while mentoring BSA's in a collaborative, remote environment.

Built byΒ Lior Neu-ner. I'd love to hear your feedback β€” Get in touch via DM or lior@remoterocketship.com