13 hours ago
Ansible
Apache
AWS
Azure
Cloud
Distributed Systems
ElasticSearch
Google Cloud Platform
Grafana
Java
Kafka
Kubernetes
Prometheus
Scala
Terraform
• Work closely with development and operations teams to build robust and efficient systems. • Manage & Monitor: Oversee the performance, reliability, and availability of large-scale Elasticsearch/OpenSearch clusters. • Optimize & Scale: Implement best practices for scaling, indexing, and querying to ensure optimal performance. • Automate & Streamline: Develop and maintain automated performance testing or benchmarking, monitoring, and alerting for Elasticsearch/OpenSearch clusters. • Troubleshoot & Resolve: Quickly identify and resolve issues related to cluster health, data integrity, performance bottlenecks, and search accuracy. • Collaborate: Work closely with development, DevOps, and other teams to design and implement enhancements to cluster architecture, stability, performance, and data management flows.
• Proven experience as an DBRE or in a similar role, with specific expertise in managing Elasticsearch or OpenSearch clusters. • Strong knowledge of Elasticsearch/OpenSearch architecture, including index management, sharding, and replication. • Experience with performance tuning, scaling, and cluster optimization. • Understanding of JVM concepts and ability to code with Java or Scala. • Familiarity with monitoring tools (e.g., Prometheus, Grafana). • Experience with configuration management and automation tools (e.g., Ansible, Terraform, Kubernetes). • Ability to diagnose and troubleshoot complex performance and stability issues in large-scale distributed systems. • Strong verbal and written communication skills to collaborate across teams and document processes clearly. • Familiarity with other distributed systems (e.g., Apache Solr, Kafka). • Knowledge of CI/CD pipelines and experience with DevOps practices. • Experience with cloud providers (AWS, Azure, GCP).
Apply Now