Chaos Engineering Architect

Yesterday

Apply Now

Description

• Seeking a Chaos Engineering Architect to enhance cloud system resilience. • Responsible for chaos engineering practices and creating chaos drills. • Collaborate with cross-functional teams for system performance improvements. • Document methodologies and conduct training for team members. • Analyze chaos experiment results for system design improvements and incident management.

Requirements

• Bachelor’s degree in Computer Science, Engineering, or a related field; advanced degree preferred. • 5+ years of experience in software engineering, systems architecture, or related fields. • Proven experience with chaos engineering principles and practices in cloud environments. • Familiarity with chaos engineering tools (e.g., Gremlin, Chaos Monkey, Litmus) and observability platforms. • Strong knowledge of cloud computing architectures (AWS, Azure, GCP). • Proficiency in programming/scripting languages (Python, Go, Java, etc.) for automation of chaos experiments. • Experience with observability tools (e.g., Prometheus, Grafana, Datadog) to derive insights from chaos tests. • Excellent problem-solving skills and ability to think critically under pressure. • Strong communication skills to effectively share insights and findings with technical and non-technical stakeholders. • Ability to work collaboratively in a fast-paced, agile environment. • Experience with site reliability engineering (SRE) practices preferred. • Familiarity with microservices architectures and container orchestration (e.g., Kubernetes) preferred. • Understanding of incident response and disaster recovery planning preferred.

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com