Flag job

Report

Junior SRE

Min Experience

1 years

Location

Pune, Bengaluru

JobType

Full Time, Permanent

About the job

Info This job is sourced from a job board

About the role

We are seeking a skilled Site Reliability Engineer (SRE) to administer and optimize our cluster-based services, ensuring high availability, data protection, and system reliability. The ideal candidate will leverage SRE principles to develop scalable system architectures, conduct in-depth log analysis, and troubleshoot complex technical issues while providing exceptional customer support. Key Responsibilities Administer and optimize cluster-based services, focusing on data protection and system reliability. Implement and manage monitoring tools, configuring alerts to proactively detect and resolve cluster anomalies. Utilize SRE principles to design and maintain scalable, highly reliable system architectures. Conduct thorough log analysis and debugging to identify and resolve system issues, with a deep understanding of Linux environments. Address storage, system, and network issues, applying best practices to minimize downtime and data loss. Provide remote expert troubleshooting support to customers, guiding them through complex technical challenges. Educate customers on system maintenance and data protection strategies to enhance self-reliance and prevent future issues. Stay updated on industry trends, emerging technologies, and SRE methodologies to drive continuous operational improvements. Required Qualifications Proven experience in System Administration with a strong focus on data protection, cluster-based service management, and SRE practices. Deep knowledge of Linux, system architecture, and network infrastructure. Proficiency in scripting and automation tools to streamline system operations. Strong ability to conduct sophisticated log analysis and debugging. Excellent communication skills, with the ability to convey technical concepts effectively to non-technical audiences. Preferred Qualifications(if any) Experience with cloud platforms and containerization technologies. Familiarity with CI/CD pipelines and DevOps methodologies. Certifications in relevant fields (e.g., Linux, Kubernetes, AWS, Google Cloud)

Skills

Linux
container
continuous integration
kubernetes
network infrastructure
data protection
sre
ci/cd
linux internals
site reliability engineering
containerization
gcp
system architecture
devops
debugging
clustering
aws
ci cd pipeline