Senior Site Reliability Engineer

6 days ago

Apply Now
Logo of Veracode

Veracode

Application Security β€’ Web Application Security β€’ Binary Static Analysis β€’ Vendor Application Security Testing β€’ Runtime Application Self Protection

501 - 1000

πŸ’° Private Equity Round on 2022-03

Description

β€’ Utilize AWS services to design scalable cloud solutions that support critical systems. β€’ Partner with engineering teams to ensure monitoring and alerting is in place, enabling consistent, scalable, and automated service delivery. β€’ Develop and improve monitoring and alerting solutions to guarantee the reliability of applications and services, using tools like Datadog and Sumologic. β€’ Lead efforts to automate infrastructure deployment and management using Terraform, Kubernetes, and other cloud-native tools. β€’ Create automated incident response workflows to handle common infrastructure and application issues. β€’ Collaborate with security teams to ensure systems adhere to industry-standard security practices and policies. β€’ Document and train engineering teams on best practices in reliability, scalability, and operational excellence. β€’ Participate in 24x7 on-call rotations to respond to incidents, triage production issues. β€’ Contribute to incident and process post-mortems. β€’ Ensure uptime, SLAs, and availability of critical platform components through process improvements and automation. β€’ Monitor existing application and infrastructure while working to improve existing monitoring. β€’ Communicate effectively with project stakeholders and management. β€’ Develop and support processes to maintain uptime, SLAs and availability of critical platform components. β€’ Troubleshoot and resolve production issues related to systems, network, and application. β€’ Ensure that our systems and processes adhere to industry-standard security practices and policies.

Requirements

β€’ Bachelor's Degree in Computer Science, Information Science, Engineering, or related/relevant field or equivalent experience. β€’ 5+ years working in a SRE, DevOps, Cloud Engineering or similar role. β€’ Experience with AWS and automation tools like Terraform, CloudFormation, or Ansible. β€’ Hands-on experience deploying, managing, and troubleshooting Kubernetes clusters. β€’ Proficiency with observability, monitoring, and alerting tools (Datadog, Sumologic, Prometheus, Grafana, etc.). β€’ Familiarity with CI/CD pipelines and repository management tools (e.g., GitLab, Jenkins, GitHub). β€’ Strong programming skills for automation (Python, Go, or similar languages). β€’ Solid understanding of infrastructure as code (IaC) and GitOps methodologies. β€’ Strong communication skills with the ability to collaborate effectively across different teams. β€’ Ability to work in an Agile environment. β€’ Proven experience in troubleshooting production environments and improving system reliability. β€’ Experience with on-call/incident management systems such as PagerDuty, VictorOps or OpsGenie.

Apply Now

Similar Jobs

6 days ago

Join Ghost Story Games as a Senior Build & Release Engineer. Maintain and improve build/release pipeline for game development.

6 days ago

As a Senior Engineering Manager, oversee DevOps for Code and Theory's applications. Collaborate and ensure delivery of scalable and secure solutions.

πŸ‡ΊπŸ‡Έ United States – Remote

πŸ’΅ $175k - $190k / year

⏰ Full Time

🟠 Senior

β›‘ DevOps & Site Reliability Engineer (SRE)

πŸ—½ H1B Visa Sponsor

6 days ago

Join the Federal Reserve as a DevOps Engineer to design and implement automation solutions for FedNow. Transform the U.S. payments landscape through innovative problem-solving and automation.

6 days ago

Leidos

10,000+

Leidos seeks a Resource Manager for the Navy's largest IT services program. Oversee SRE team management and infrastructure for operational efficiency.

November 27

Lead DevOps Engineer at HALO overseeing AWS and Azure cloud practices for robust services.

πŸ‡ΊπŸ‡Έ United States – Remote

πŸ’΅ $100k - $130k / year

πŸ’° Private Equity Round on 2016-01

⏰ Full Time

🟠 Senior

β›‘ DevOps & Site Reliability Engineer (SRE)

πŸ—½ H1B Visa Sponsor

Built byΒ Lior Neu-ner. I'd love to hear your feedback β€” Get in touch via DM or lior@remoterocketship.com