Overview
Company name: SentiLink | HQ Location: San Francisco, California | Website | LinkedIn
Role: Software Engineer Observability SRE
- Experience: 5-10 years
- Salary: Rs. 40-80 lakhs per annum
- Location: Gurgaon
- Type: Full-time
About the Role
We’re currently looking for a Software Engineer specializing in Observability & Site Reliability Engineering (SRE) to join our dynamic team in Gurgaon. In this role, you will play a critical part in ensuring the performance, reliability, and scalability of our platform. You’ll work closely with engineering, DevOps, and infrastructure teams to build tools and systems that improve visibility, incident response, and overall system health.
Key Responsibilities
- Build and maintain observability tools including logging, metrics, tracing, and alerting systems
- Implement and refine SLOs, SLIs, and error budgets to guide reliability efforts
- Develop automation to improve system reliability, reduce manual work, and enhance incident response
- Collaborate with cross-functional teams to ensure monitoring and alerting are integrated from day one
- Participate in on-call rotations and root cause analyses to drive long-term improvements
What We’re Looking For
- 5+ years of experience in Software Engineering or SRE roles
- Strong programming/scripting skills (Python, Go, Bash, or similar)
- Hands-on experience with observability tools such as Prometheus, Grafana, ELK, Datadog, or similar
- Good understanding of Linux systems, networking, and cloud infrastructure (AWS/GCP/Azure)
- Familiarity with Kubernetes, Docker, and CI/CD pipelines
- Strong debugging, analytical, and problem-solving skills