Cloud Site Reliability Engineer - SRE

2 days ago

Apply Now
Logo of Agilite Group

Agilite Group

Talend Data Integration • Talend MDM • Talend ESB • Talend DQ • Business Intelligence

201 - 500

Description

• Play a key role in ensuring the reliability, scalability, and performance of cloud systems. • Monitor, optimize, and troubleshoot cloud infrastructure and services to minimize downtime. • Implement best practices for high availability and disaster recovery across cloud environments. • Develop and maintain automation scripts and Infrastructure as Code (IaC) templates. • Collaborate with development teams to design scalable and performant cloud architectures. • Participate in incident response activities, including root cause analysis and resolution. • Implement security best practices and compliance measures in cloud environments. • Monitor resource utilization and forecast capacity requirements for business growth. • Maintain comprehensive documentation of cloud configurations, processes, and procedures. • Share knowledge and best practices with team members and contribute to a culture of continuous learning.

Requirements

• Bachelor's Degree in Computer Science, Information Technology, or a related field. • 4+ years of experience in cloud operations, SRE, or a related role. • Proficiency in cloud platforms such as AWS, Azure, or Google Cloud. • AWS Certification is Mandatory – AWS Certified Solution Architect. • Certification in cloud platforms (e.g., AWS Certified Solution Architect, Google Cloud Professional DevOps Engineer, Azure DevOps Engineer Expert). • Experience with containerization and orchestration tools (e.g., Docker, Kubernetes). • Knowledge of infrastructure monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack). • Strong scripting and programming skills (e.g., Python, Bash, Go). • Familiarity with CI/CD pipelines and automation tools (e.g., Jenkins, GitLab CI/CD). • Excellent problem-solving and communication skills. • Ability to work collaboratively in a cross-functional and fast-paced environment.

Apply Now

Similar Jobs

3 days ago

Zingtree

11 - 50

Mid-Level DevOps Engineer at Zingtree automating processes and supporting customer experience operations.

🇺🇸 United States – Remote

💰 $15M Series A on 2022-01

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🗽 H1B Visa Sponsor

3 days ago

Cloud Operations Engineer improving incident command and monitoring at Lumin Digital.

🇺🇸 United States – Remote

💵 $100k - $125k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

3 days ago

Cutover

51 - 200

Ensure the reliability of Cutover's enterprise platform using modern technologies.

🇺🇸 United States – Remote

💵 $105k - $165k / year

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🗽 H1B Visa Sponsor

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com