December 30, 2024
• Assist in the operational support of multiple global multi-tenant cloud-based applications • The role requires you to keep vitally important IT systems up and running by overseeing day-to-day monitoring and response to alarms, fault and performance management activities. • Proactively monitor systems, networks, and applications to provide input in improving the stability, security, efficiency, and scalability of systems • Collaborates with stakeholders to ensure the success of cloud infrastructure operations, implementations, and infrastructure automation strategies throughout the organization • Helps establish and improve engineering best practices, concepts, and patterns with peers and the business. • Closely collaborate with software development leads to understand workload requirements and guide them to the best leverage of cloud services, optimizing for performance, security, and architectural flexibility. • Leads analysis of end-to-end system failures to identify opportunities across multiple systems. • Participate in incident reviews to create improved supportability documentation, diagnostics, tooling, error messages and automation • Manage the secure, scalable and resilient hosting of numerous applications in a regulated (HIPAA) environment that improves the lives of thousands of people on a daily basis. • Implement monitoring and security controls across various platforms. • Collaborate closely with the multiple technology and cross-functional groups within the organization. • Proven experience to analyze and run audit forensics, trend analysis and cloud data reporting • Manage ticket assignments, documentation and escalations for thorough and timely outcomes.
• Extensive hands-on experience with key AWS services, particularly Amazon Elastic Compute Cloud (EC2), API Gateway, Elastic Load Balancers (ELB), Simple Storage Service (S3), Elastic Block Store (EBS), and Amazon Elastic Compute Service (ECS). • Knowledge of secure cloud architecture and best practices such as AWS’s Well-Architected Framework • Experience with Application and Infrastructure Monitoring such as AWS Cloudwatch, Datadog, and PagerDuty • Practical knowledge of cloud automation and infrastructure-as-code (IaC) tools, such as AWS CloudFormation, Terraform, or Ansible. • In-depth architectural understanding of AWS and familiar with cloud native design patterns and DevOps principles (Infrastructure as Code) • Experience with Change management, incident review and root cause analysis of maintenances and network outages • Excellent communications skills and extensive experience working with technical teams and management • Knowledge and proficient understanding of designing scalable IT system infrastructures, implementing system changes, or automating service delivery and maintenance tasks.
Apply NowDecember 23, 2024
Looking for a Senior Cloud Infrastructure Engineer to enhance Tala’s cloud services and applications. Join us in empowering the Global Majority through innovative financial solutions.
December 20, 2024
Join TrueML, enhancing customer experience through software for distressed borrowers and improving infrastructure.