TRUGlobal
Website:
truglobal.com
Job details:
Senior Site Reliability Engineer
Experience: 5+ Years
Location: Bangalore
Role Overview
Looking for a Senior SRE – Tools & Telemetry with strong hands-on experience in observability platforms to act as a technical SME across our monitoring and telemetry stack. This role requires someone who can identify gaps, recommend improvements, and personally implement solutions across complex production environments.
Observability Stack
Dynatrace
Prometheus
Splunk
Grafana
Loki
OpManager
Key Responsibilities
- Serve as a hands-on SME for observability tools across applications and platforms.
- Design, implement, and improve metrics, logs, dashboards, alerts, and SLOs.
- Proactively identify observability gaps and drive improvements end-to-end.
- Partner with SRE, DevOps, and application teams to reduce MTTR and improve reliability.
- Optimize alerting, telemetry quality, and platform cost/performance.
Required Experience
- 5+ years in SRE, Observability roles.
- Strong hands-on experience with Dynatrace, Prometheus, Splunk, OpManager, Grafana, and/or Loki.
- Experience supporting production, business-critical systems.
- Proven ability to recommend and implement improvements, not just operate tools.
Click on Apply to know more.