Sr Site Reliability Engineer

August 29

Apply Now
Logo of Stord

Stord

Cloud Supply Chain | Fulfillment, Transportation & Technology

Storage • 3PL Management • Warehousing • Distribution • Third Party Logistics

501 - 1000

Description

• The SRE team is committed to accelerating development, enabling continuous delivery, enhancing security and ensuring operational excellence. • Collaborate with cross-functional teams to design and implement CI/CD pipelines that automate fast and safe delivery of software to customers. • Lead efforts in automating deployment, monitoring, and infrastructure management. • Proactively identify and resolve performance bottlenecks, system failures, and security vulnerabilities. • Implement best practice infrastructure as code (IaC) principles for configuration management and deployment of infrastructure. • Enhance operational efficiency by identifying repetitive tasks and developing automation to eliminate toil work. • Implement robust metrics, monitoring and alerting for proactive issue identification and resolution. • Participate in incident response, on-call rotation and post-incident reviews. • Implement and enforce security best practices for infrastructure and applications. • Share knowledge through documentation, training, and mentorship.

Requirements

• Proven experience as a Senior DevOps Engineer or Senior Site Reliability Engineer. • Strong expertise in cloud platforms such as AWS, GCP or Azure. • Strong experience with CI/CD tools (Github Actions, GitLab CI, CircleCI) and version control systems (Git). • Proficiency with infrastructure-as-code tools (e.g., Terraform, Ansible, Cloudformation). • Hands-on experience with container orchestration tools like Docker and Kubernetes. • Solid understanding of networking, security, and system engineering. • Experience with monitoring and logging tools (e.g., Datadog, Prometheus, Grafana, ELK stack). • Strong scripting skills in languages such as Python, Shell or similar. • Familiarity with security best practices and compliance requirements. • Excellent problem-solving and troubleshooting skills. • Ability to work collaboratively in a fast-paced, agile environment. • Passion for building the highest-quality solutions for the long term that delight the customer (both internal and external customers). • Automation first mindset. • High degree of ownership and pride for work.

Benefits

• 401(k) • Medical, Dental, and Vision Insurance • Life and Disability Insurance • Health Savings Account (HSA) option • Employee Assistance Program (EAP) - Mental Health Resources • Paid Parental Leave • Gym Stipend • Paid Time Off • Paid holidays • And more!

Apply Now

Similar Jobs

August 29

Opportunity to contribute to leading college sports recruiting network.

August 28

Toast

1001 - 5000

Enable engineering teams to ensure smooth operation of Toast’s restaurant platform.

🇺🇸 United States – Remote

💵 $131k - $210k / year

⏰ Full Time

🟠 Senior

👨🏻‍🔧 Site Reliability Engineer (SRE)

🗽 H1B Visa Sponsor

August 28

Ensure health, availability, and performance of database systems for Wikimedia Foundation.

🇺🇸 United States – Remote

💵 $109k - $169k / year

💰 $2.5M Grant on 2019-09

⏰ Full Time

🟠 Senior

👨🏻‍🔧 Site Reliability Engineer (SRE)

🗽 H1B Visa Sponsor

August 28

Ensure database system health for efficient knowledge sharing at Wikimedia.

🇺🇸 United States – Remote

💵 $109k - $169k / year

💰 $2.5M Grant on 2019-09

⏰ Full Time

🟠 Senior

👨🏻‍🔧 Site Reliability Engineer (SRE)

🗽 H1B Visa Sponsor

August 23

Gemini

501 - 1000

Lead Site Reliability Engineers to ensure reliability and scalability of Gemini's infrastructure.

🇺🇸 United States – Remote

💵 $172k - $215k / year

💰 Venture Round on 2022-02

⏰ Full Time

🟡 Mid-level

🟠 Senior

👨🏻‍🔧 Site Reliability Engineer (SRE)

🗽 H1B Visa Sponsor

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com