Technology • Payment Systems • Alternative Payments • Risk Management • Currency Exchange
5001 - 10000
2 days ago
Technology • Payment Systems • Alternative Payments • Risk Management • Currency Exchange
5001 - 10000
• Join a team of system administrators and engineers responsible for designing, implementing and maintaining System and Cloud Observability & Log Management solutions. • Implement and manage observability tools such as Splunk, Zabbix, Dynatrace, Datadog. • Set up and configure dashboards, alerts, and reports that provide visibility into system health, performance, and availability. • Develop and maintain centralized logging solutions to ensure comprehensive logging coverage, log retention, and log security. • Work with IT, DevOps, and product teams to define KPIs and SLOs for critical systems and applications. • Provide support in monitoring and troubleshooting production systems. • Assist in automating monitoring tasks and creating self-healing scripts. • Analyze logs and telemetry data for insights on incident detection and performance optimization. • Participate in on-call rotations and collaborate with security teams to ensure log management solutions support security monitoring.
• At least 5 years of experience in IT Operations, with a focus on monitoring, observability, and log management. • Solid understanding of Open Telemetry (OTEL) based monitoring and observability concepts, including metrics, logs, traces, and alerts. • Hands-on experience with observability and monitoring tools (e.g., Splunk Observability, Zabbix, Datadog, Dynatrace, Prometheus, Grafana, New Relic). • Strong understanding of log management best practices, including centralized logging, data retention, and privacy requirements. • Familiarity with cloud platforms (e.g., AWS, Azure, GCP) and managing cloud-based monitoring solutions. • Experience in designing and implementing system health dashboards, alerting mechanisms, and automated incident response processes. • Strong problem-solving skills and the ability to work under pressure in a fast-paced environment. • Basic scripting skills (e.g., Python, Bash) for task automation and custom monitoring solutions. • Excellent communication and collaboration skills, with the ability to work with cross-functional teams. • Added bonus if you have: • A bachelor's degree or greater in computer science, information technology, or a related field. • Practical experience in the role can be used in place of formal education. • Knowledge of ITIL or similar frameworks for incident and problem management. • Exposure to DevOps principles and experience with CI/CD pipelines. • Experience in container monitoring (e.g., Kubernetes, Docker) and cloud-native architectures. • Certifications: Technical certifications in cloud and virtualization technologies are highly valued. • Any certifications for AWS, Azure, MSCE, RH or VMware Certified Professional (VCP), VMware Certified Advanced Professional (VCAP), and Citrix Certified Associate - Virtualization (CCA-V), Datadog, Dynatrace, Splunk or other observability tools.
• A competitive salary and benefits. • Time to support charities and give back to your community. • Parental leave policy. • Global recognition platform. • Virgin Pulse access. • Global employee assistance program.
Apply Now3 days ago
10,000+
Bridge market intelligence and regional strategy implementation at Agilent's Business Intelligence Team.
🇺🇸 United States – Remote
💵 $127.7k - $235.4k / year
💰 Post-IPO Debt on 2019-09
⏰ Full Time
🟡 Mid-level
🟠 Senior
🧐 Analyst
3 days ago
10,000+
Cigna improves health outcomes through case management of Medicare members.
3 days ago
10,000+
Lead System Analyst supporting USMC training with analysis and AAR product creation.
🇺🇸 United States – Remote
💵 $75.8k - $94.8k / year
💰 $71k Grant on 2014-09
⏰ Full Time
🟠 Senior
🧐 Analyst
3 days ago
11 - 50
Provide support for healthcare AI application users at SmarterDx.
🇺🇸 United States – Remote
💵 $70k - $100k / year
💰 $6M Seed Round on 2022-04
⏰ Full Time
🟠 Senior
🧐 Analyst