Stord

Website LinkedIn All Job Openings

Storage • 3PL Management • Warehousing • Distribution • Third Party Logistics

501 - 1000 employees

Founded 2019

🛍️ eCommerce

Senior Site Reliability Engineer

August 29

🇺🇸 United States – Remote

🍑 Georgia – Remote

⏰ Full Time

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🦅 H1B Visa Sponsor

Ansible

AWS

Azure

Cloud

Docker

Google Cloud Platform

Grafana

Kubernetes

Prometheus

Python

Terraform

Apply Now

Stord

Website LinkedIn All Job Openings

Storage • 3PL Management • Warehousing • Distribution • Third Party Logistics

501 - 1000 employees

Founded 2019

🛍️ eCommerce

Description

• The SRE team is committed to accelerating development, enabling continuous delivery, enhancing security and ensuring operational excellence. • Collaborate with cross-functional teams to design and implement CI/CD pipelines that automate fast and safe delivery of software to customers. • Lead efforts in automating deployment, monitoring, and infrastructure management. • Proactively identify and resolve performance bottlenecks, system failures, and security vulnerabilities. • Implement best practice infrastructure as code (IaC) principles for configuration management and deployment of infrastructure. • Enhance operational efficiency by identifying repetitive tasks and developing automation to eliminate toil work. • Implement robust metrics, monitoring and alerting for proactive issue identification and resolution. • Participate in incident response, on-call rotation and post-incident reviews. • Implement and enforce security best practices for infrastructure and applications. • Share knowledge through documentation, training, and mentorship.

Requirements

• Proven experience as a Senior DevOps Engineer or Senior Site Reliability Engineer. • Strong expertise in cloud platforms such as AWS, GCP or Azure. • Strong experience with CI/CD tools (Github Actions, GitLab CI, CircleCI) and version control systems (Git). • Proficiency with infrastructure-as-code tools (e.g., Terraform, Ansible, Cloudformation). • Hands-on experience with container orchestration tools like Docker and Kubernetes. • Solid understanding of networking, security, and system engineering. • Experience with monitoring and logging tools (e.g., Datadog, Prometheus, Grafana, ELK stack). • Strong scripting skills in languages such as Python, Shell or similar. • Familiarity with security best practices and compliance requirements. • Excellent problem-solving and troubleshooting skills. • Ability to work collaboratively in a fast-paced, agile environment. • Passion for building the highest-quality solutions for the long term that delight the customer (both internal and external customers). • Automation first mindset. • High degree of ownership and pride for work.

Benefits

• 401(k) • Medical, Dental, and Vision Insurance • Life and Disability Insurance • Health Savings Account (HSA) option • Employee Assistance Program (EAP) - Mental Health Resources • Paid Parental Leave • Gym Stipend • Paid Time Off • Paid holidays • And more!

Apply Now