UST
Website:
ust.com
Job details:
Role Description
Who we are:
At UST, we help the world’s best organizations grow and succeed through transformation. Bringing together the right talent, tools, and ideas, we work with our client to co-create lasting change. Together, with over 26,000 employees in 25 countries, we build for boundless impact—touching billions of lives in the process. Visit us at .
The Opportunity
Site Reliability Engineer
Bangalore, KA
Key Responsibilities
- Observability Strategy: Define and execute a full-stack observability roadmap aligned with business and IT goals, embedding AIOps and SRE principles.
- Monitoring Frameworks: Design and implement comprehensive monitoring solutions for applications, infrastructure, and networks to ensure continuous performance and availability.
- Data Analysis & Insights: Use AIOps-driven analytics to identify trends, predict failures, and automate corrective actions.
- Tool Ownership & Integration: Manage and optimize observability tools (Splunk, Datadog, Prometheus, Grafana, ThousandEyes, ServiceNow AIOps, etc.), integrating them across hybrid environments.
- Automation & Intelligence: Develop automated workflows for ing, incident detection, and root cause analysis using scripting and AI-driven approaches.
- Dashboarding & Reporting: Build intelligent dashboards and provide actionable insights to stakeholders on system health, incidents, and performance improvements.
- Incident & Problem Management: Partner with ITSM teams to enhance detection, triage, and resolution workflows with AI-assisted root cause analysis.
- Continuous Improvement: Stay updated with emerging observability and AIOps technologies, integrating them to enhance monitoring capabilities.
Qualifications
- 7 to 10 years in IT infrastructure, monitoring, and observability roles.
- Strong experience in AIOps platforms and applying AI/ML for monitoring, anomaly detection, and predictive analytics.
- Expertise with observability tools: Datadog, OpManager, Splunk, Dynatrace, AppDynamics, New Relic, Prometheus, Grafana, Nagios, etc.
- Familiarity with cloud-native monitoring across AWS, Azure, GCP, and on-premise data centers.
- Proficiency in scripting/automation (Python, Shell, PowerShell, Ansible).
- Experience with DevOps and cloud-native environments (Kubernetes, Docker, Terraform, CI/CD pipelines).
- Knowledge of database monitoring (SQL and NoSQL).
- Strong analytical and problem-solving skills for proactive detection and resolution.
- Excellent communication and collaboration skills to work across IT Ops, DevOps, Security, and Application teams.
- Experience presenting monitoring insights and observability metrics to executives and stakeholders.
- Solid foundation in networking and Linux administration.
- Experience with Atlassian tooling (Jira, Confluence) preferred.
- Certifications (ITIL, DevOps, AWS, Azure, GCP, Agile, PMP) are a plus
What We Believe
We’re proud to embrace the same values that have shaped UST since the beginning. Since day one, we’ve been building enduring relationships and a culture of integrity. And today, it's those same values that are inspiring us to encourage innovation from everyone to champion diversity and inclusion and to place people at the centre of everything we do.
Humility
We will listen, learn, be empathetic and help selflessly in our interactions with everyone.
Humanity
Through business, we will better the lives of those less fortunate than ourselves.
Integrity
We honor our commitments and act with responsibility in all our relationships.
Equal Employment Opportunity Statement
UST is an Equal Opportunity Employer. We believe that no one should be discriminated against because of their differences, such as age, disability, ethnicity, gender, gender identity and expression, religion, or sexual orientation.
All employment decisions shall be made without regard to age, race, creed, color, religion, sex, national origin, ancestry, disability status, veteran status, sexual orientation, gender identity or expression, genetic information, marital status, citizenship status or any other basis as protected by federal, state, or local law.
UST reserves the right to periodically redefine your roles and responsibilities based on the requirements of the organization and/or your performance.
- To support and promote the values of UST.
- Comply with all Company policies and procedures
Skills
observability,aiops,SRE,splunk,datadog,
Click on Apply to know more.