October 1
β’ Assist in the operational support of multiple global multi-tenant cloud-based applications β’ The role requires you to keep vitally important IT systems up and running by overseeing day-to-day monitoring and response to alarms, fault and performance management activities. β’ Proactively monitor systems, networks, and applications to provide input in improving the stability, security, efficiency, and scalability of systems β’ Helps establish and improve engineering best practices, concepts, and patterns with peers and the business. β’ Closely collaborate with software development leads to understand workload requirements and guide them to the best leverage of cloud services, optimizing for performance, security, and architectural flexibility. β’ Leads analysis of end-to-end system failures to identify opportunities across multiple systems. β’ Participate in incident reviews to create improved supportability documentation, diagnostics, tooling, error messages and automation β’ Manage the secure, scalable and resilient hosting of numerous applications in a regulated (HIPAA) environment that improves the lives of thousands of people on a daily basis. β’ Implement monitoring and security controls across various platforms. β’ Collaborate closely with the multiple technology and cross-functional groups within the organization. β’ Proven experience to analyze and run audit forensics, trend analysis and cloud data reporting β’ Manage ticket assignments, documentation and escalations for thorough and timely outcomes.
β’ Proven work experience with AWS (e.g. ECS, Fargate, S3, Lambda, Cloudwatch, etc) β’ Experience with Cloud Security standards and frameworks including NIST, CIS , etc as well as cloud security services (e.g. AWS GuardDuty, Cloudtrail, Security Hub, Inspector, etc) β’ Knowledge of secure cloud architecture and best practices such as AWSβs Well-Architected Framework β’ Experience with Application and Infrastructure Monitoring such as AWS Cloudwatch, Datadog, and PagerDuty β’ In-depth architectural understanding of AWS and familiar with cloud native design patterns and DevOps principles (Infrastructure as Code) β’ Experience with Change management, incident review and root cause analysis of maintenances and network outages β’ Excellent communications skills and extensive experience working with technical teams and management β’ Knowledge and proficient understanding of designing scalable IT system infrastructures, implementing system changes, or automating service delivery and maintenance tasks.
Apply NowSeptember 22
51 - 200
Seeking cloud infrastructure engineers for ClickHouse Cloud managed services.
August 11
5001 - 10000
Help build and run large-scale, fault-tolerant systems for eLearning services.
June 27
51 - 200
August 16, 2023
51 - 200
Collabora seeks a CI & Testing Developer for Open Source projects.