Tekgence Inc
Website:
tekgence.com
Job details:
Role: Global Head of Service Reliability (Senior Recovery Lead)
Location: Pune / Hyderabad (Hybrid)
Duration: 12 Months (Contract to Hire)
Role Summary:
Lead global service reliability and incident recovery by managing major incidents, reducing recovery time, and driving long-term resilience through SRE, automation, and engineering best practices.
Key Responsibilities:
- Lead global incident recovery and act as technical escalation during major outages
- Reduce TTR through effective coordination and root cause analysis
- Drive reliability strategy including automation, self-healing, and resilience engineering
- Partner with SRE, Platform, and Problem Management teams to prevent recurring issues
- Conduct chaos testing, recovery simulations, and scenario planning
- Build and lead a high-performing global team
Requirements:
- 12+ years in SRE, DevOps, Infrastructure, or Technical Operations
- Strong experience in incident management, RCA, and system reliability
- Proven leadership of global teams in high-scale environments
- Expertise in automation, cloud, and resilience engineering
- Excellent stakeholder and communication skills
Click on Apply to know more.