SRE - Site Reliability Engineer

September 4

Apply Now
Logo of Gigster

Gigster

App development • Web development • Rapid Prototyping • Product Management • software development

11 - 50

Description

• Work cross-functionally with customers and partners to address complex customer questions and drive incidents to resolution. • Troubleshoot, isolate, and resolve container orchestration/management issues, specifically Docker and Kubernetes. • Troubleshoot, isolate, and resolve complex application deployment issues. • Analyze and categorize customer interaction trends to provide accurate and meaningful feedback to Engineering and SRE organizations. • Build knowledge base articles. • Participate in post-mortem reviews, and drive ongoing improvement. • Obsess over minute details to drive operational excellence in every aspect of your work. • Make meaningful and significant contributions to our client’s development of objectives and goals. • Work on complex issues, analyzing situations or data that require in-depth evaluation. • Your expertise and discretion will be required to think outside the box and resolve highly complex issues creatively and effectively. • Work hours are from 8 AM to 5 PM United Kingdom time, Monday to Friday. While there is no on-call, if there’s any maintenance during the weekend you will need to be available, and then you'll get time off to compensate for that.

Requirements

• Robust level of spoken and written English. • Several years of experience (ideally 5 or more) in supporting highly scalable applications and web services. • Experience working with Cloud technologies (AWS mandatory, Google Cloud - nice to have). • Hands-on experience with open-source technologies such as Kubernetes and Docker (mandatory), Spark, and Kafka (at least basic knowledge of these two). • Extensive experience in deploying, managing, and scaling applications using Kubernetes. Familiarity with Helm, Kustomize, or similar tools. • Deep understanding and hands-on experience with Linux administration, scripting (Bash, Python or any similar programming/scripting language), and troubleshooting. • Understanding of network protocols, DNS, load balancing, and related technologies. • Comfortable working with a very technical customer base. • Comfortable providing a wide range of support, from simple issue resolution to end-to-end onboarding. • Strong familiarity with modern server technologies and architectures. • Strong problem-solving skills, excellent communication skills, and the ability to work collaboratively in a team environment. • Strong attention to detail. • Excellent analytical capabilities. • Passionate about Technical Support. • Strong troubleshooting experience. • Ability to work as an independent contractor for a US company. • Nice to Have: Observability and logging tools experience like Prometheus, Grafana, Datadog, Splunk, etc. • Nice to Have: Hands-on experience with Relational Database Management Systems. • Nice to Have: Relevant certifications such as CKA (Certified Kubernetes Administrator), CKAD (Certified Kubernetes Application Developer), or Linux Professional Institute Certification (LPIC). • Nice to Have: Familiarity with tools like Terraform, Ansible, or Puppet.

Benefits

• World-class network. Be part of a network with the most talented people in the world. • Amazing cutting-edge projects. Pick the projects from F500 companies that you’re interested in. • 100% remote and global. Live your best life, wherever that may be, and never lose out on career opportunities because of it. • Flexible work hours. There is a time to overlap with the customer’s timezone, but most of the time, we work asynchronously and don’t care when you’re online; you just deliver great results. • Flexible offerings. Choose how many hours you want to work and how much you want to earn. • Swag! Because who doesn’t love swag?

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com