Talent500
Website:
talent500.co
Job details:
About HME:
For over 50 years, HME has created industry-leading products and services, their earliest being the first wireless microphone for the professional audio market in 1974. Since then, they have evolved greatly and pioneered into a variety of niche markets, setting new benchmarks with their work.
HME believes that there’s more to a person than what’s written on their resume. HME sees their employees for who they are and value every idea and opinion — it’s what fuels their innovative thinking and helps deliver market-leading products and services. As a part of our team at HME GCC, you will help HME leverage cutting-edge cloud technologies, and empower multiple industries to thrive by enabling seamless connectivity and enhancing communication.
HME is looking for a Senior Software Engineer to own and operate our OpenSearch-based Observability Platform. This role is responsible for running a highly available, secure, and scalable OpenSearch platform on Azure Kubernetes Service (AKS), acting as the DevOps and SRE owner and enabling application teams to onboard telemetry using Open Telemetry.
What you will do in the position:
- Own DevOps and SRE responsibilities for the OpenSearch observability platform.
- Design, deploy, and operate OpenSearch clusters on Kubernetes with focus on high availability, scalability, resiliency, and disaster recovery.
- Manage cluster sizing, shard strategy, index lifecycle management, retention, and performance tuning.
- Lead upgrades, capacity planning, and zero / minimal-downtime operations.
- Collaborate with application teams to integrate Open Telemetry-based logs, metrics, and traces into OpenSearch.
- Build and maintain monitoring, alerting, and dashboards for OpenSearch and underlying infrastructure.
- Own platform security including authentication, authorization, TLS, and secrets management.
- Troubleshoot production issues and automate operational workflows using IaC and CI/CD.
- Define SLIs, SLOs, and operational runbooks.
What you will need to succeed in this position:
- 4+ years of experience in software engineering, DevOps, or SRE roles.
- Hands-on experience with OpenSearch or Elasticsearch in production.
- Strong understanding of distributed systems and Kubernetes (AKS preferred).
- Experience with Azure cloud services and observability platforms.
- Hands-on experience with Open Telemetry and telemetry pipelines.
- Experience with monitoring, alerting, and security best practices.
- Familiarity with IaC tools (Terraform, Helm) and CI/CD pipelines.
- Working knowledge of the ELK stack is a plus
- Strong problem-solving and communication skills.
- Bachelor’s degree in computer science or a related field.
Click on Apply to know more.