Observability Engineer - SRE

April 6

Apply Now
Logo of The Baldwin Group

The Baldwin Group

The Baldwin Group is a comprehensive risk management, insurance, employee benefits, and wealth management advisor. The firm provides tailored solutions for businesses and individuals, helping clients navigate the complex landscape of insurance and financial protection. With expertise in business insurance, personal insurance, and employee benefits, The Baldwin Group aims to streamline solutions, adapt strategies to changing needs, and mitigate unique industry-specific challenges. Additionally, the company offers private client services and wealth management, further showcasing their dedication to protecting what matters to clients. As an official insurance broker for the Tampa Bay Buccaneers, The Baldwin Group has a strong presence in the insurance industry.

Commercial Risk Management β€’ Private Risk Management β€’ Personal Insurance β€’ Employee Benefits β€’ Asset and Income Protection

1001 - 5000 employees

Founded 2011

πŸ’Έ Finance

πŸ“‹ Description

β€’ The Baldwin Group is seeking an Observability/SRE Engineer to enhance infrastructure reliability. β€’ Contribute significantly to Observability, APM, Monitoring, and Logging strategy. β€’ Develop tools for monitoring application performance and operational efficiency. β€’ Collaborate with software engineers to integrate observability best practices into development. β€’ Lead efforts to ensure incident response plans minimize downtime and improve procedures.

🎯 Requirements

β€’ 3+ years of experience as a Observability or Site Reliability Engineer role. β€’ Experience with cloud infrastructure platforms such as AWS or Azure. β€’ Proven Experience with administering Observability, Monitoring tools (Datadog or similar). β€’ Experience with containerized and serverless compute technology (Docker, ECS, Kubernetes, Lambda, etc.) β€’ Experience with DevOps & CI/CD processes and tools (GitHub, Terraform, Ansible etc.). β€’ Experience in integrations b/w DevOps, SRE, Testing tools to generate DORA metrics, reports and create dashboards. β€’ Understanding of SRE principles including SLO, SLI, KPI, Metrics, logging, tracing etc. β€’ Proficient in writing scripts (Bash, PowerShell) and program in one or more language (Python, JavaScript, Go, Java, or similar). β€’ Experience in capacity planning and scaling resource requirements based on traffic patterns and performance metrics. β€’ Experience in preparing, executing, and improving incident response plans. β€’ Strong understanding of on-call rotation practices and incident escalation processes. β€’ Knowledge of security best practices and compliance standards relevant to observability and monitoring (e.g., GDPR, HIPAA). β€’ Datadog or relevant Certifications preferred. β€’ Highly self-motived, highly available, and driven to exceed colleague expectation β€’ Ability to think critically and logically under pressure. β€’ Strong technical experience with proven history of troubleshooting complex, cross segment, cross office, and cross team problems.

πŸ–οΈ Benefits

β€’ Unknown Benefits

Apply Now

Discover 100,000+ Remote Jobs!

Join now to unlock all jobs

Discover hidden jobs

We scan the internet everyday and find jobs not posted on LinkedIn or other job boards.

Head start against the competition

We find jobs as soon as they're posted, so you can apply before everyone else.

Be the first to know

Daily emails with new job openings straight to your inbox.

Choose your membership

Loved by 10,000+ remote workers
πŸŽ‰$6 / week

Cancel anytime

MOST POPULAR
πŸ₯³$18 / month
$24
Save 25% vs weekly

Cancel anytime

BEST VALUE
πŸ₯°$54 / year
$216
Save 75% vs monthly

Cancel anytime

Wall of Love

Frequently asked questions

We use powerful scraping tech to scan the internet for thousands of remote jobs daily. It operates 24/7 and costs us to operate, so we charge for access to keep the site running.

Of course! You can cancel your subscription at any time with no hidden fees or penalties. Once canceled, you’ll still have access until the end of your current billing period.

Other job boards only have jobs from companies that pay to post. This means that you miss out on jobs from companies that don't want to pay. On the other hand, Remote Rocketship scrapes the internet for jobs and doesn't accept payments from companies. This means we have thousands more jobs!

New jobs are constantly being posted. We check each company website every day to ensure we have the most up-to-date job listings.

Yes! We’re always looking to expand our listings and appreciate any suggestions from our community. Just send an email to Lior@remoterocketship.com. I read every request.

Remote Rocketship is a solo project by me, Lior Neu-ner. I built this website for my wife when she was looking for a job! She was having a hard time finding remote jobs, so I decided to build her a tool that would search the internet for her.

Why I created Remote Rocketship

Choose your membership

Loved by 10,000+ remote workers
πŸŽ‰$6 / week

Cancel anytime

MOST POPULAR
πŸ₯³$18 / month
$24
Save 25% vs weekly

Cancel anytime

BEST VALUE
πŸ₯°$54 / year
$216
Save 75% vs monthly

Cancel anytime

Built by Lior Neu-ner. I'd love to hear your feedback β€” Get in touch via DM or lior@remoterocketship.com