Landmark Group
Website:
landmarkgroup.com
Job details:
Experience: 7–13 Years
Role Overview:
We are looking for a Lead Site Reliability Engineer (Application SRE) to improve application reliability, monitoring, and troubleshooting for large-scale systems. The ideal candidate should have strong experience working on the application side of SRE, focusing on performance, stability, and incident resolution.
Key Responsibilities:
- Ensure reliability, performance, and availability of critical applications.
- Design and implement effective application monitoring and alerting.
- Troubleshoot complex application issues in distributed systems.
- Work closely with development teams to improve system stability and performance.
Key Skills:
• Strong experience in Application SRE practices
• Hands-on with Java, Microservices, and Kafka
• Experience in application monitoring, reliability, and troubleshooting at an L3 level.
• Strong ability to write complex SQL queries
• Strong knowledge on Kubernetes
Good to Have:
• Team management or mentoring experience
NOTE📌- This role is focused on Application SRE. Candidates primarily from DevOps or infrastructure-only backgrounds may not be the right fit.
Click on Apply to know more.