First Responders • Cybersecurity • Big Data • Cyber Intelligence • Research and Development
51 - 200
September 17
First Responders • Cybersecurity • Big Data • Cyber Intelligence • Research and Development
51 - 200
• Collaborate with Customer Support and DevOps teams to establish SLA, SLO, and SLI • Maintain 24/7 production stability year-round • Deploy, configure, and monitor production environments • Automate production deployments, validations, and reporting processes • Develop and maintain tools for production operations • Manage and document incidents • Develop disaster recovery automation • Handle Mean Time to Respond (MTTR) and Mean Time to Detect (MTTD) metrics • Implement strategies to ensure 100% application uptime • Work with development and QA teams to enhance code quality and resilience
• At least 2 years of experience in a similar role (DevOps, SRE, System Engineer) • Experience with IaC practices (Terraform) • Experience with Docker and Kubernetes • Experience with one of the major cloud providers (AWS, Azure) • Worked with Linux Administrative Skills • Proven work experience with Python is mandatory • Excellent problem-solving and communication skills • Be willing to understand the business logic of each component and its impact
• Working from home • Flexible hours • Yearly performance bonus • Paid medical insurance • Daily lunch allowance • Sport/Gym(Exercise) allowance • Udemy unlimited subscription • Onboarding plan and training • Equipment support • No dress code • Gifts and rewards • Happy hours, coffee time, online team building, company events • Fresh fruit, snacks, coffee, and tea
Apply NowSeptember 16
10,000+
Develop and maintain healthcare systems for patient safety and confidentiality.
August 28
201 - 500
Enhance operational efficiency by improving processes at NASA Federal Credit Union.
August 8
51 - 200
Drive vision and systems balancing for an online multiplayer experience.