Flag job

Report

DevOps SDE1_AP241125

Min Experience

3 years

Location

Bengaluru

JobType

full-time

About the job

Info This job is sourced from a job board

About the role

Job Title: Senior engineer (SDE-2)– Platform Engineering Location: Bengaluru Employment Type: Full-time Team: Platform Engineering About the Role:We are looking for a passionate and hands-on DevOps Engineer to join our Platform Engineering team and accelerate our platform modernization journey. This role is ideal for engineers who thrive in automation-heavy environments and want to contribute to the evolution of scalable, secure and intelligent infrastructure platforms. You will work on next-gencloud-native platforms, implement AI-powered automation and help build self-healing, resilient systems that support our applications at scale. Key Responsibilities● Manage platform modernization initiatives including containerization, service mesh adoption and migration to microservices and serverless infrastructure.● Design and implement robust CI/CD pipelines and self-service DevOps platforms to streamline software delivery across environments.● Develop and manage Infrastructure as a Code (IaaC) using tools like Terraform or CloudFormation for scalable and repeatable deployments.● Automate infrastructure provisioning, configuration management and operations using tools like Ansible, Chef or Puppet.● Leverage AI/ML-driven automation for predictive alerting, anomaly detection, auto-scaling and intelligent incident response.● Build intelligent, self-healing systems focused on resilience, cost optimization and performance forecasting in public cloud environments.● Drive observability by establishing standards for monitoring, logging and alerting using tools such as Prometheus, Grafana, OpenTelemetry and AIOps platforms.● Embed security and compliance into DevOps workflows by adopting DevSecOps practices throughout the software development lifecycle.● Evaluate emerging technologies and methodologies to improve system reliability, developer experience and platform scalability.● Participate in SRE on-call rotations, production support and post-incident reviews to continuously improve system resilience.● Build internal tools and automation solutions to enhance platform observability and operational efficiency.● Identify and resolve performance bottlenecks and lead root cause analysis efforts for critical incidents. ● Collaborate cross-functionally with engineering, architecture and security teams to drive best practices and architectural alignment.● Support disaster recovery planning, backup strategy implementation and compliance initiatives (e.g., SOC2, ISO).● Mentor junior engineers, promote knowledge sharing and foster a culture of engineering excellence. Qualifications● 3–6 years of experience in DevOps, SRE or platform engineering roles with a software engineering mindset.● Hands-on expertise in Kubernetes, Docker and service mesh architectures (Istio, Linkerd).● Expertise in CI/CD tools such as Jenkins, ArgoCD, Spinnaker or similar tools for automating and managing deployment workflows.● Experience with observability stacks (Prometheus, Grafana, ELK, Loki or Datadog) for monitoring, logging and alerting.● Good understanding of AIOps and ML-driven automation, including anomaly detection, intelligent alerting and predictive incident response● Strong problem-solving and debugging skills, particularly in complex, production-grade distributed systems● Expertise in Infrastructure as a Code (IaaC) using tools like Terraform or Pulumi and proficiency in configuration management with Ansible, Puppet or similar tools.● Familiarity with event-driven architectures using tools like Kafka or cloud-native pub/sub messaging systems.● Good understanding of cloud cost optimization and efficiency practices through automation and resource management.● Experience integrating security scanning and compliance checks into CI/CD pipelines using tools like Trivy, Snyk or arnica.

Skills

kubernetes
docker
istio
linkerd
jenkins
argocd
spinnaker
prometheus
grafana
elk
loki
datadog
terraform
pulumi
ansible
puppet
kafka