Site Reliability Technical Lead

August 31

Apply Now
Logo of 3Pillar Global

3Pillar Global

Building digital businesses, together.

Product Strategy • Product Development • Product Architecture • Mobile Apps • Big Data

1001 - 5000

💰 Private Equity Round on 2021-10

Description

•Independently guide the technical direction and implementation by the whole team within defined architecture in all stages from conceptualization to deployment. •Evaluate trade-offs between correctness, robustness, performance, and customer impact to ensure the development of the right solution, with client success at the forefront. •Create and lead the team's technical documentation and repository management practices, including tasks such as creating branches, pull requests, merges, etc. •Collaborate with product, design, and engineering teams to provide necessary oversight of architecture and dependencies influencing product strategy and direction. •Contribute to code reviews, documentation, and addressing complex bug fixes with a focus on security, performance, and reliability. •Be an active leader in the Engineering Practice community, mentoring Senior Engineers and others through Communities of Practice (CoPs) or on project teams, supporting the growth of technical capabilities.

Requirements

•Will be participating in the early phase of an applications lifecycle from architecture and design with a focus on resilience, fault tolerance, failure scenarios, dependencies, observability and scalability. He will also be responsible for setup, configuration and management of SRE tools for visibility on the environment. •Bachelor's Degree in Information Technology, Computer Science or equivalent work experience •3+ years of experience in SRE engineering role for supporting highly available production systems in cloud environments •Experience with design and implementation of SRE functions implementing mature SRE best practices. •Experience with defining SRE standards and supporting implementation and adoption of these standards. •Experience with using and enablement of monitoring and alerting tools and services (Datadog preferred). •Understanding of cloud architectures, microservices and distributed systems •Adept in the development of automated tools, systems, and services in multiple technology domains •Advanced knowledge of one or more infrastructure components (e.g. networking, cloud srvices, orchestration tools, containerization, compute, and storage systems) •Proficiency in service-level changes to a system and troubleshooting components •Experience in managing SLA and incident response calls.

Benefits

•Employer funded medical plan for employees. •Employer funded dental plan for employees. •401K retirement savings plan •Company paid disability and life insurance and the option to purchase additional coverage for yourself and family. •Unlimited PTO Policy; we promote a flexible work environment and encourage our employees to maintain a healthy work/life balance. •11 Company paid Holidays Generous Parental Leave

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com