bytespoke (Arrixa)
Website:
bytespoke.com
Job details:
Job Summary
We are looking for 8+ years experienced OpenTelemetry (OTel) Operations Engineer to manage and maintain observability solutions across applications and infrastructure. The role involves implementing, monitoring, and optimizing telemetry data (metrics, logs, and traces) using OpenTelemetry to ensure system reliability, performance, and operational visibility.
Key Responsibilities
- Implement and manage OpenTelemetry (OTel) instrumentation for applications and services.
- Configure and maintain metrics, logs, and distributed tracing pipelines.
- Monitor system performance and troubleshoot issues using observability tools.
- Integrate OpenTelemetry with monitoring platforms such as Grafana, Prometheus, or similar tools.
- Ensure telemetry data collection is optimized for performance and cost.
- Collaborate with DevOps, SRE, and development teams to improve system observability.
- Manage alerts, dashboards, and incident analysis based on telemetry insights.
- Support production environments and ensure high availability of monitoring systems.
Required Skills
- Strong 8+ years' experience with OpenTelemetry (OTel) frameworks and collectors.
- Knowledge of observability concepts: metrics, logs, traces.
- Experience with monitoring tools such as Prometheus, Grafana, Datadog, or ELK stack.
- Familiarity with Kubernetes, Docker, and cloud platforms (AWS/Azure/GCP).
- Experience with CI/CD pipelines and DevOps practices.
- Basic scripting knowledge (Python, Bash, or similar).
Preferred Qualifications
- Experience in SRE or DevOps operations roles.
- Understanding of microservices architecture.
- Exposure to distributed systems monitoring.
Click on Apply to know more.