Platform Reliability Analyst

October 6

Apply Now
Logo of Bounteous

Bounteous

501 - 1000

💰 Private Equity Round on 2021-08

Description

• The Platform Reliability Analyst is responsible for ensuring the continuous monitoring and overall health of our cloud infrastructure hosted on platforms such as AWS, Rackspace, Expedient, and Heroku. • This role involves proactive monitoring of system performance, coordinating incident response efforts, and collaborating with development, cloud, and operations teams to address issues before they impact the business. • The ideal candidate will have a process-oriented mindset, strong communication skills, and a foundational understanding of cloud technologies to facilitate rapid resolution of incidents and optimize system performance.

Requirements

• Strong Communication Skills: Clear, concise English to convey the status of incidents and performance issues to both technical and non-technical stakeholders. • Process-Oriented Mindset: Ability to follow, document, and improve processes to ensure smooth incident management and resolution. • Attention to Detail: Capability to record key details about system health, performance, and incident facts versus theories in real-time. • Familiarity with Monitoring Tools: Experience using monitoring and alerting tools such as New Relic, Cloudwatch, or Datadog, and familiarity with logs, traffic monitoring, and system health metrics. • Coordination and Leadership Skills: Ability to lead incident response teams, coordinate with various technical experts, and manage communication effectively during outages. • Basic Technical Understanding: While not an engineering role, some technical familiarity with cloud environments, system alerts, and security practices is important. Entry-level engineers with an interest in coordination roles are encouraged to apply. • Collaboration: Ability to work cross-functionally with development, cloud, and support teams to ensure smooth operations and proactive issue resolution.

Apply Now

Similar Jobs

October 3

Fortrea

10,000+

Workday Prism Analyst delivering analytics to optimize HR and Finance decisions at Fortrea.

October 3

NCR Atleos

10,000+

Manage NCR services delivery to improve customer satisfaction and service fidelity.

October 3

Trellix

1001 - 5000

Create and maintain Workday reports and dashboards for Trellix's HR operations.

October 1

Clario

1001 - 5000

Solutions Design Analyst for Clario's eCOA platform delivery team.

September 24

CSG

5001 - 10000

Customer support role for product training and tech assistance at CSG.

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com