Senior Site Reliability Engineer

June 3

🇲🇽 Mexico – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Apply Now
Logo of IO Connect Services

IO Connect Services

Software Engineering • System Integrations • Cloud Technologies • Cybersecurity • Big Data

51 - 200

Description

• Responsible for designing, building, maintaining, and scaling production services and server farms across multiple data centers for complex and data-intensive cloud services • Design and enhance software architecture to improve scalability, service reliability, capacity, and performance • Write automation code for provisioning and operating infrastructure at massive scale. You are not an operator, you’re an experienced software engineer focused on operations • Work with development teams to make sure the applications fit nicely within the infrastructure and scalability/reliability is designed and implemented from the grounds up. You will work with QA on building pipelines and automation for delivering and deploying applications to production • Roll up the sleeves to troubleshoot incidents, formulate theories and test your hypothesis, and narrow down possibilities to find the root cause • Write postmortem reviews and remediation recommendation • Identify bad trends before they become problems; respond to automated system alerts, effectively troubleshoot system errors and work incidents to return systems to normal operating conditions • Author and update high-quality documentation of all relevant specifications, systems and procedures • Support and comply with the company’s Quality Management System policies and procedures

Requirements

• Bachelor’s degree (or equivalent) in computer science or related discipline • Knowledge of IaC technologies such as Terraform, Ansible, Puppet, Chef • Knowledge of Cluster creation and management through Kubernetes • Knowledge of Microsoft Azure, AWS, Google Cloud, Azure services, Virtual Machine in Azure, Virtual Network Configuration • Knowledge in design patterns such as: Iaas, Paas, and Saas • Knowledge in CI/CD • Scripting knowledge with PowerShell • IPs and Mask knowledge • Ability to program (structured and OOP) using one or more high-level languages, such as Python, Java, C/C++, Ruby, and JavaScript • Experience with distributed storage technologies such as NFS, HDFS, Ceph, and Amazon S3, as well as dynamic resource management frameworks (Apache Mesos, Kubernetes, Yarn) • Proactive approach to identifying problems, performance bottlenecks, and areas for improvement

Benefits

• Base Salary and permanent contract directly with the company • Continuous training plan with paid certifications • Carreer plan according to your development and knowledge • Benefits above the law: 12 days of Paid Time Off, 30 day Christmas Bonus, Medical Insurance, Life Insurance, Savings Fund, Groceries Bonus • Quarterly Performance Bonus • Computer equipment for your work • Optional 100% Home Office

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com