Senior Site Reliability Engineer

August 29

Apply Now
Logo of Stord

Stord

Storage • 3PL Management • Warehousing • Distribution • Third Party Logistics

501 - 1000

Description

• The SRE team is committed to accelerating development, enabling continuous delivery, enhancing security and ensuring operational excellence. • Collaborate with cross-functional teams to design and implement CI/CD pipelines that automate fast and safe delivery of software to customers. • Lead efforts in automating deployment, monitoring, and infrastructure management. • Proactively identify and resolve performance bottlenecks, system failures, and security vulnerabilities. • Implement best practice infrastructure as code (IaC) principles for configuration management and deployment of infrastructure. • Enhance operational efficiency by identifying repetitive tasks and developing automation to eliminate toil work. • Implement robust metrics, monitoring and alerting for proactive issue identification and resolution. • Participate in incident response, on-call rotation and post-incident reviews. • Implement and enforce security best practices for infrastructure and applications. • Share knowledge through documentation, training, and mentorship.

Requirements

• Proven experience as a Senior DevOps Engineer or Senior Site Reliability Engineer. • Strong expertise in cloud platforms such as AWS, GCP or Azure. • Strong experience with CI/CD tools (Github Actions, GitLab CI, CircleCI) and version control systems (Git). • Proficiency with infrastructure-as-code tools (e.g., Terraform, Ansible, Cloudformation). • Hands-on experience with container orchestration tools like Docker and Kubernetes. • Solid understanding of networking, security, and system engineering. • Experience with monitoring and logging tools (e.g., Datadog, Prometheus, Grafana, ELK stack). • Strong scripting skills in languages such as Python, Shell or similar. • Familiarity with security best practices and compliance requirements. • Excellent problem-solving and troubleshooting skills. • Ability to work collaboratively in a fast-paced, agile environment. • Passion for building the highest-quality solutions for the long term that delight the customer (both internal and external customers). • Automation first mindset. • High degree of ownership and pride for work.

Benefits

• 401(k) • Medical, Dental, and Vision Insurance • Life and Disability Insurance • Health Savings Account (HSA) option • Employee Assistance Program (EAP) - Mental Health Resources • Paid Parental Leave • Gym Stipend • Paid Time Off • Paid holidays • And more!

Apply Now

Similar Jobs

August 28

Softrams

201 - 500

Deliver virtual infrastructure for mission-critical health IT solutions using DevOps practices.

August 28

Ensure health, availability, and performance of database systems for Wikimedia Foundation.

🇺🇸 United States – Remote

💵 $109k - $169k / year

💰 $2.5M Grant on 2019-09

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🗽 H1B Visa Sponsor

August 28

Ensure database system health for efficient knowledge sharing at Wikimedia.

🇺🇸 United States – Remote

💵 $109k - $169k / year

💰 $2.5M Grant on 2019-09

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🗽 H1B Visa Sponsor

August 27

Zencoder

11 - 50

Extend expertise in cloud and bare metal deployment to shape innovative software solutions.

🇺🇸 United States – Remote

💰 Venture Round on 2014-01

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

August 27

Element

11 - 50

Streamline software development for federal clients using DevOps practices in a consultative role.

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com