Site Reliability Engineer

October 15

🇹🇼 Taiwan – Remote

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

Apply Now

Description

Aethir is an Enterprise-grade AI-focused GPU-as-a-service provider. Responsible for monitoring, troubleshooting, and optimizing production systems for AI and gaming customers worldwide.

Requirements

Bachelor's degree in Computer Science, Engineering, or a related field. Experience in operations and maintenance development, preferably in a cloud computing or AI-focused environment. Strong understanding of system architecture, performance monitoring, and troubleshooting methodologies. Excellent communication and collaboration skills. Ability to work in a fast-paced, startup environment. Proficiency in Kubernetes (K8S), CI/CD, and Docker. Expertise in AWS (VPC, S3, EC2, etc.) or Python (one of the two). Responsible for building the operations and maintenance infrastructure platform and handling core business operations. Management experience is a plus, but not required. Prior experience working in structured environments such as Huawei, ZTE, or banking institutions is preferred.

Benefits

Hypergrowth Startup Environment Fantastic Career Progression Opportunities Work within a Global and Local Team Collaborative and innovative work environment with opportunities to contribute to cutting-edge projects.

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com