Report

Site Reliability Engineer

Salary

₹8 - 15 LPA

Min Experience

2 years

Location

remote

JobType

full-time

About the job

Info This job is sourced from a job board

Overview

About the role

We are looking for a talented and experienced SRE to join our team. You'll help scale our operations, design and maintain robust infrastructure, and implement best practices for reliability and efficiency in our cloud-native environment. Responsibilities Manage and maintain Kubernetes clusters (on-prem and cloud: OpenShift, EKS, AKS, and GKE). Implement and manage CI/CD pipelines using tools like GitHub Actions, Argo CD, or GitLab. Design and maintain observability pipelines with tools like Prometheus, Grafana, OpenTelemetry, and others. Optimize system performance and troubleshoot production issues. Implement SRE concepts, including SLIs and SLOs, to ensure system reliability. Automate infrastructure and operational tasks using programming languages like Golang or Python and IaC like Terraform or Pulumi. Stay updated on emerging trends like AI, MLOps, and Edge Computing. Share knowledge via technical writing and speaking engagements. Qualifications Bachelor's degree in Computer Science, IT, or a related field. 2+ years of experience in SRE or DevOps roles. Strong experience with Kubernetes and cloud platforms (AWS, Azure, GCP). Proficiency in programming (Python, Golang, or Node.js). Familiarity with CI/CD tools and modern deployment strategies. Knowledge of observability tools and infrastructure as code. Excellent problem-solving and communication skills.

Skills

kubernetes

python

golang

node.js

ci/cd

observability

infrastructure as code