4 days ago
• Design, implementation, and maintenance of infrastructure with a strong focus on services such as Puppet and monitoring systems like Zabbix, Prometheus, Grafana, ensuring high availability, disaster recovery, and optimal performance • Develop and support CI/CD automations and microservices, improving their interaction and efficiency within the department's area of responsibility • Design robust production systems and related toolsets, ensuring utmost uptime, and troubleshooting as needed • Improve services monitoring, implement proactive incident detection, and streamline reaction to incidents • Operation and continuous development of internal services, focusing on automation and seamless interaction between services and components • Regularly perform data backups, restoration, and conduct disaster recovery exercises • Document infrastructure configurations, processes, and procedures, prepare operational guidelines
• Deep knowledge of Linux and networking stack, with strong experience in systems engineering, particularly in Puppet and monitoring systems like Zabbix, Prometheus, Grafana • Strong scripting/programming skills in languages such as Python and Bash for automation purposes • Experience in supporting database services • Knowledge of best practices and approaches related to systems engineering and operations, including monitoring, storage, backup, security, and high availability • Proven ability to manage and maintain configuration management systems, ensuring efficient service delivery and infrastructure management • Experience in the development and support of CI/CD automations and microservices (desired) • Familiarity with VMware Virtualization and storage systems (desired) • Solid troubleshooting skills in complex, distributed, and virtualized environments (desired) • Understanding of Site Reliability Engineering (SRE) practices (desired) • Quick learner with the ability to grasp complex systems rapidly and the eagerness to stay updated with the latest technological advancements (desired) • Excellent communication skills, with the ability to work effectively across different teams and with various stakeholders, serving as a point of escalation for complex issues (desired)
• Competitive Salary • Flexible schedule • Remote, hybrid or office work • Educational support • Medical insurance (depending on the contract type and your location) • Business trips (depending on your role)
Apply Now