5 days ago
• We are seeking a highly skilled and experienced Lead, Software Engineer - SRE to join our team. • In this role, you will play a crucial role in engineering highly resilient systems at scale. • Combining advanced Software Engineering practices with mature Operations skills, you will be responsible for delivering and operating reliable cloud services. • As a Lead, Software Engineer - SRE, you will ensure that our Cloud services meet the reliability and uptime requirements of our enterprise customers. • Design and implement highly scalable and resilient cloud operations to minimize customer downtime. • Conduct root cause analysis on outages and drive the implementation of corrective actions. • Collaborate with product teams to enhance resiliency across the product lifecycle. • Monitor customer infrastructure, proactively measuring availability and system health. • Collaborate with customer support to resolve escalated outages and incidents. • Troubleshoot complex incidents in highly distributed systems, identifying root causes and implementing effective solutions. • Improve the accuracy of alarms and shorten the time to detection to minimize impact. • Contribute as a key stakeholder in the design and architecture of resilient cloud services.
• Bachelor's, Master's degree, or PhD in Computer Science or a related field. • 6+ years of experience in software development or operations. • Proficiency in at least one high-level programming language (C++, Python, Java, C#, etc.). • Strong troubleshooting and debugging skills. • Availability to work in shifts and participate in the on-call rotation. • Fluency in English and excellent communication skills. • Experience with automation and Infrastructure as Code (IaC) tools such as CloudFormation, Terraform, Chef, etc. (Preferred). • Familiarity with AWS services like EC2, RDS, Lambda, and step-functions. (Preferred). • Knowledge of containerization technologies like Docker and orchestration platforms like Kubernetes. (Preferred). • Proficiency in monitoring and troubleshooting complex distributed systems. (Preferred). • Strong understanding of designing resilient and fault-tolerant systems. (Preferred). • Expertise in debugging complex distributed systems. (Preferred).
• A company that continues to grow, change, and innovate, and gives our teams the space to be proactive and creative. • Real career opportunities. • We care about growth and development. • Work colleagues who are as smart, hardworking, and driven as you – and a team that is global. • We offer the possibility for lateral moves, joining different teams, and mastering specific skills. • Disrupting the status quo is in our DNA. • Join us in disrupting the status quo of the low-code market, we give you the power to "Ask Why", you give our customers the power to innovate through software!
Apply NowJanuary 23
501 - 1000
Join Bitsight to support the Core Data team in developing data solutions. Drive change while enhancing their cyber risk management services.
January 15
Join QuintoAndar as a Technical Lead Manager, guiding the Search & Recommendations team to innovate real estate technology solutions.
January 11
Join Sword Health to innovate pain treatment technology as a Senior Algorithms Software Engineer remotely from Portugal.
January 10
Oversee software development at Sword Health, transforming pain treatment with technology and innovation.
January 3
Seeking a Senior .NET developer and Tech Lead to evolve LineTen’s software solutions for urban delivery.