Site Reliability Engineer

October 17

Apply Now
Logo of Chatbooks

Chatbooks

Printing MemoriesA • Photo Books • Prints • Photography • Consumer Products

51 - 200

Description

• Site Reliability Engineers (SREs) at Chatbooks design, implement, and maintain distributed, fault-tolerant systems. • They learn quickly and apply sound engineering principles to existing and new problems. • Much of the work focuses on improving and optimizing existing systems, building infrastructure, and eliminating work through automation. • Implement and improve observability, incident response, and monitoring practices that lead to greater operational excellence. • Take ownership of uptime, reliability, and performance of core systems. • Collaborate across departments responding to and resolving issues. • Leverage a combination of commercial software and services, open-source products, and cloud offerings to manage applications, infrastructure, and services with a customer-focused mindset.

Requirements

• Clear written and verbal communication skills. • Experience working in one of the large public cloud environments (AWS, Azure, or GCP). • Knowledge of distributed systems, including load balancing and data replication. • Proficiency with logging and monitoring tools, such as Prometheus, Grafana and Elastic APM. • Familiarity with containerization technologies, such as Docker and Kubernetes. • Background in managing web applications, backend services, and system architecture. • Using infrastructure automation tools for deployments like Terraform or CloudFormation. • Possess a growth mindset, including advancing your skills—a fast learner of new concepts and technologies. • Strong understanding of continuous integration and continuous deployment (CI/CD) pipelines. • Prior hands-on software development experience.

Benefits

• 100% coverage for medical for you and your family • 100% coverage for employee dental, and vision, group life and long-term disability insurance —yes, you read that right we cover it all • Unlimited, free therapy sessions from the comfort of your home • 4-12 weeks paid Parental Leave • Paid Holidays • A good work/life balance is encouraged! We’ve implemented Mandatory Time Off with a recommendation to take 1 week off every season • Company-wide winter break to relax with your family • 401k plan with employer match of up to 4% • We are a remote first company headquartered in Lehi, Utah • Company sponsored WFH Setup • Ability to travel 2-3 times a year to our Lehi, Utah Clubhouse for in-person collaboration and annual Chatfest celebration

Apply Now

Similar Jobs

October 15

Lucidworks

201 - 500

DevOps Engineer for Lucidworks’ cloud platform, ensuring customer success through automation.

🇺🇸 United States – Remote

💵 $140k - $155k / year

💰 $100M Series F on 2019-08

⏰ Full Time

🟡 Mid-level

🟠 Senior

⛑ DevOps & Site Reliability Engineer (SRE)

🗽 H1B Visa Sponsor

October 12

Support implementation of SRE practices for enhanced system reliability.

October 11

FWDthink

11 - 50

FWDthink seeks a DevOps/Cloud Security Engineer for DHS cloud security projects.

Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com