AgileEngine
Website:
agileengine.com
Job details:
🔥 Senior Software Engineer / SRE – Observability Platform | India
🚀 Immediate Hiring | Fast-Track Interviews | Work on Large-Scale Distributed Systems
We’re hiring a Senior Software Engineer / SRE to join a high-impact engineering team at Indeed. This role is ideal for engineers passionate about backend systems, Kubernetes, cloud infrastructure, monitoring, and observability platforms.
📍 Location: India
---
💥 Why You Should Apply:
• Work on scalable distributed systems powering enterprise-grade platforms
• Own observability, monitoring, reliability, and platform engineering initiatives
• Exposure to Kubernetes-based microservices environments
• Opportunity to work with Datadog, Prometheus, Grafana, and cloud-native tooling
• Collaborate with global engineering teams on reliability and performance improvements
• Strong ownership with real operational impact
---
✅ What We Need (Non-Negotiable):
• 4+ years of experience with Java, Python, or Node.js
• Strong hands-on experience with Kubernetes environments
• Experience with Datadog or similar tools like Prometheus and Grafana
• Ability to configure dashboards, alerts, logging, metrics, and APM tracing
• Experience monitoring containerized and microservices architectures
• Strong AWS experience
• Experience integrating observability tools into cloud environments
• Experience with CI/CD integrations for observability platforms
• Strong scripting and automation skills
• Hands-on experience with API integrations
• Upper-intermediate English communication skills
---
💼 What You’ll Do:
• Design, build, and maintain scalable backend and platform components
• Implement observability solutions across distributed systems
• Configure dashboards, alerts, metrics, logs, and tracing solutions
• Improve system reliability, scalability, and performance
• Deploy and operate services in Kubernetes environments
• Integrate monitoring tools into CI/CD pipelines and cloud infrastructure
• Automate operational and monitoring workflows using scripting
• Provide operational and training support for observability platforms, especially Datadog
• Collaborate with engineering teams to improve visibility and reliability practices
---
➕ Bonus (Good to Have):
• Experience owning internal engineering or observability platforms
• Strong ownership mindset around reliability and scalability
• Experience installing and configuring Datadog agents and integrations
• Experience managing API keys, user roles, and secure configurations
• Familiarity with Go (Golang)
• Exposure to tools like New Relic, Dynatrace, Elastic Stack, or Splunk
---
⚡ Priority Given To Candidates Who:
• Have strong hands-on Kubernetes + Observability platform experience
• Have worked on large-scale distributed or microservices-based systems
• Demonstrate strong ownership and operational excellence
• Can proactively drive platform improvements and reliability initiatives
---
🚨 Serious applicants only. Profiles without strong Kubernetes + Observability experience will not be considered.
Please read the below critical requirements carefully. We can proceed only if you meet all the mentioned criteria.
- This role is primarily a Software Engineering role (60–70%) with SRE/Observability responsibilities, requiring hands-on coding in Python / Node.js / Java.Do you have recent hands-on experience building backend services in production?
• Do you have 4+ years of hands-on experience with Java, Python, or Node.js in production environments?
• Do you have strong real-world Kubernetes experience managing containerized workloads?
• Do you have hands-on experience with Datadog, Prometheus, Grafana, or similar observability tools?
• Have you configured dashboards, alerts, metrics, logging, and APM tracing in production systems?
• Do you have experience monitoring distributed microservices architectures?
• Do you have hands-on AWS cloud experience?
• Have you integrated observability tools into CI/CD pipelines and cloud infrastructure?
• Do you have scripting and automation experience for operational workflows?
• Do you have experience with API integrations?
• Are you comfortable supporting and improving platform reliability, scalability, and performance?
• Do you have good English communication skills for global collaboration?
• Can you join within a short notice period?
---
📩 To Apply Share:
1. Email ID
2. Years of Experience
3. Current CTC / Expected CTC
4. Notice Period
5. Current Location
Click on Apply to know more.