Senior System Operations Engineer

3 hours ago

Apply Now
Logo of GR8 Tech

GR8 Tech

BaaS • Platform • Franchise • Sports Betting • Igaming

501 - 1000

Description

• Designing and maintaining high-load and highly available datacenter infrastructure • Conducting capacity planning to ensure scalability for current and future demands • Drawing up and maintaining up-to-date department documentation • Collecting requirements and leading the development and execution of Proof of Concepts (PoCs) • Designing and conducting performance tests to validate infrastructure handling of expected loads • Designing, implementing, and managing infrastructure as code using tools like Terraform • Designing, implementing, and managing CI/CD pipelines to automate infrastructure deployment • Documenting incidents, resolutions, and lessons learned for improving response strategies • Planning, executing, and managing datacenter infrastructure projects focused on improvements • Leading and mentoring junior and mid-level engineers, fostering their professional growth • Participating in cross-departmental meetings to align datacenter operations with organizational goals.

Requirements

• Strong understanding of networking concepts, protocols, and services (TCP/IP, DNS, DHCP, HTTP/HTTPS, VLANs, BGP) • Experience implementing traffic load distribution systems (HAProxy, Nginx) • Proficiency in configuring and troubleshooting host virtualized networks. • Expert-level management of Windows Server and Linux (RedHat, Debian-based) systems • In-depth knowledge of system architecture, configuration, and optimization. • Advanced expertise in VMware and KVM technologies, including performance tuning • Proficient in vSphere components (ESXi, vCenter, vSan) • In-depth knowledge of server hardware (CPUs, RAM, storage, power supplies, network interfaces) • Strong troubleshooting skills for hardware failures and performance issues. • Advanced knowledge of Kubernetes architecture, deployment, and management. • Proficiency with tools like Zabbix, Grafana, Prometheus, and ELK Stack • Experience with VMware monitoring solutions (vRealize Operations Manager, Log Insight) • Expertise in automation tools (Ansible, AWX) and scripting (Python, Bash, PowerShell) • Proficiency with Infrastructure as Code (IaC) tools like Terraform and Terragrunt. • Experience creating and maintaining CI/CD pipelines. • Skilled in databases like MySQL, MariaDB, PostgreSQL, and MongoDB, including replication and clustering • Proficiency in managing storage solutions (NFS, vSan, Ceph, object storage). • Deep understanding of private and hybrid cloud platforms. • Familiarity with compliance standards. • Expertise in designing and managing robust backup solutions. • Ability to architect scalable, resilient, high-load infrastructure solutions. • Proven experience in handling high-severity incidents and implementing long-term fixes.

Benefits

• Sports compensation • Medical coverage • Psychological support • Home-office coverage. • Remote work, Coworking compensation • Childcare budget • Maternity leave • Paternity leave • Additional 2 days for family events.

Apply Now

Similar Jobs

2 days ago

As Senior Product Operations Manager at Invisible, oversee product lifecycle and collaborate with teams for innovation.

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com