Flag job

Report

Senior Engineer in Site Reliability Engineering

Min Experience

3 years

Location

Bangalore

JobType

full-time

About the job

Info This job is sourced from a job board

About the role

Lead incident management processes to ensure high service availability and timely resolution of incidents Proficiency in Grafana and Prometheus for monitoring system performance and configuring alerting mechanisms Strong capability in incident triaging and root cause analysis to swiftly identify and resolve application issues Collaboration with cross-functional teams to enhance system reliability and performance through automation and optimization strategies Utilization of SQL for troubleshooting and data analysis Preferred skills: proficiency in Python programming, experience with Google Apps Script, and a background in automation processes Required qualifications: Bachelor of Technology in Computer Science Engineering or Master of Technology in Information Technology or Software Engineering Preferred certifications: Certified Kubernetes Administrator and AWS Certified DevOps Engineer – Professional

About the company

Altimetrik delivers outcomes for our clients by rapidly enabling digital business & culture and infuse speed and agility into enterprise technology and connected solutions. We are practitioners of end-to-end business and technology transformation. We tap into an organization's technology, people, and assets to fuel fast, meaningful results for global enterprise customers across financial services, payments, retail, automotive, healthcare, manufacturing, and other industries. Founded in 2012 and with offices across the globe, Altimetrik makes industries, leaders and Fortune 500 companies more agile, empowered and successful.Altimetrik helps get companies get "unstuck". We're a technology company that lives organizations a process and context to solve problems in unconventional ways. We're a catalyst for organization's talent and technology, helping teams push boundaries and challenge traditional approaches. We make delivery more bold, efficient, collaborative and even more enjoyable.

Skills

incident management
grafana
prometheus
alert triaging
alert configuration
datadog
debugging and troubleshooting
python
automation
sql