Staff SRE (Site Reliability Engineer)

September 4

🇦🇷 Argentina – Remote

⏰ Full Time

🔴 Lead

👨🏻‍🔧 Site Reliability Engineer (SRE)

Apply Now
Logo of Gigster

Gigster

A fully-managed software development partner that builds teams to support emerging tech like AI and Blockchain.

App development • Web development • Rapid Prototyping • Product Management • software development

11 - 50

Description

• Work on cutting-edge projects with talented developers at Gigster. • Ensure reliability, scalability, and performance as a Staff Site Reliability Engineer. • Shape infrastructure for clients and improve overall service quality.

Requirements

• Design, build, and maintain scalable and reliable infrastructure. • Collaborate with engineering teams to ensure systems are designed with reliability and scalability in mind. • Evaluate and integrate new technologies to enhance our infrastructure. • Implement and maintain monitoring and alerting systems to detect and respond to issues promptly. • Lead incident response efforts, ensuring quick resolution and effective communication. • Conduct post-incident reviews and drive improvements based on findings. • Architect & Build innovative automation projects (preferably in Python/GoLang) from scratch to help reduce day-to-day SRE toil • Create Bash scripts to automate mundane manual activities like upgrades, status checks and deployment • Develop and maintain infrastructure as code (IaC) using tools such as Terraform, Ansible, or similar. • Automate repetitive tasks and processes to improve efficiency and reduce manual intervention. • Collaborate with cross-functional teams to deliver high-quality products and services. • Mentor and guide junior SREs and other team members. • Advocate for best practices in reliability engineering across the organization. • Drive initiatives to improve service reliability, capacity, and performance. • Participate in capacity planning and disaster recovery exercises. • Stay current with industry trends and emerging technologies. • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience). • 8+ years of minimum experience in the industry as a Software Engineer, SRE or Platform Engineer. • Minimum 3+ years of experience as a Platform Engineer or SRE. • Proven experience in managing large-scale, mission-critical infrastructure. • Deep understanding of Linux/Unix systems and networking. • Proficiency in at least one or more programming languages (e.g., Python, Go, Java). • Intermediate to Expert level skill in bash scripting • Experience with cloud platforms (AWS, Azure, GCP) and container orchestration (Docker, Kubernetes). • Strong knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack). • Familiarity with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI). • Excellent problem-solving skills and a proactive attitude. • Strong communication and collaboration skills. • Ability to work independently and as part of a team. • Demonstrated leadership and mentoring abilities. • Candidates must be able to work during Pacific time hours 8am - 5pm PST, open to on-call rotation.

Benefits

• World-class network. Be part of a network with the most talented people in the world. • Amazing cutting-edge projects. Pick the projects from F500 companies that you’re interested in. • 100% remote and global. Live your best life, wherever that may be, and never lose out on career opportunities because of it. • Flexible work hours. There is a time to overlap with the customer’s timezone, but most of the time, we work asynchronously and don’t care when you’re online; you just deliver great results. • Flexible offerings. Choose how many hours you want to work and how much you want to earn. • Swag! Because who doesn’t love swag?

Apply Now
Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or lior@remoterocketship.com