Lera Technologies
Website:
lera.us
Job details:
Position: Senior DevOps Engineer - K8 and Platform Operations
Location: Hyderabad
Experience: 4-7 Years
About Us
Lera Technologies is a future-focused, AI-led digital transformation company that empowers businesses to innovate and grow in today’s fast-paced technology landscape. Our core strength lies in our flagship products like the 9X Data Platform, a state-of-the-art solution for seamless data ecosystem management, and FinSight 360, our advanced GenBI platform. We partner with enterprises across East Africa, West Africa, and Southeast Asia to solve complex challenges around data modernisation, integration, governance, and operational efficiency. At Lera, we don’t just enable transformation. We engineer it!
We are looking for a Senior DevOps Engineer who thrives in production environments. You have worked with Kubernetes in anger, you understand Kafka is not just a queue, and you care about keeping systems secure and stable under pressure. If you are structured, technically deep, and client-aware - we would love to connect.
Role Overview
This is a dedicated production support role on a live banking engagement - not a build role or a generalist DevOps position. You will own the Kubernetes environment running Lera’s 9X Data Platform for a major bank in Southeast Asia, be the first responder for incidents, and work directly with the client’s IT and Data teams. You will also lead the OpenShift to Kubernetes migration during the implementation phase. This is the right role if you are 4-7 years in, have genuine K8 production experience, and are ready to own a client environment end-to-end.
Your Role
As a Senior DevOps Engineer, you will
Kubernetes Operations
- Administer and operate the Kubernetes (K8) cluster in production: node health, pod lifecycle, resource optimisation, capacity planning, and cluster upgrades
- Manage namespaces, RBAC policies, network policies, and ingress controllers
- Maintain Helm charts and ensure version-controlled, repeatable environment configurations
- Lead the migration of the platform environment from OpenShift to Kubernetes during implementation phases M1 and M2
Kafka Operations
- Monitor and maintain Kafka clusters: topic management, consumer group health, partition balance, and performance tuning
- Investigate and resolve Kafka connectivity failures, lag issues, and replication anomalies
- Ensure Kafka integration stability across all platform microservices within the Kubernetes environment
DevSecOps
- Integrate security scanning (SAST/DAST) into CI/CD pipelines and enforce secure build practices
- Manage secrets using Vault or Kubernetes Sealed Secrets; ensure no hardcoded credentials in any environment
- Enforce container image security: vulnerability scanning, base image policies, and registry controls
- Implement and maintain network policies, pod security standards, and RBAC across the cluster
- Maintain compliance with client security policies and assist in security audits and reviews
Production Support and Incident Management
- Act as first responder for all production incidents (P1-P4): triage, root cause analysis, fix coordination, and controlled restarts and reruns
- Provide active monitoring and intervention during COB/batch cycles; investigate failures and validate reconciliation
- Deploy approved fixes, patches, minor releases, and configuration changes to production using CI/CD pipelines
- Maintain 24x7 P1 on-call availability on a shared rotation with the second support engineer
- Conduct proactive maintenance: log reviews, performance trend analysis, pre-emptive alerting
- Liaise with client IT, Data, and application teams; escalate to Lera L3 engineering when required
- Author and maintain operational runbooks, maintenance procedures, and incident resolution logs
- Participate in weekly and bi-weekly service reviews and contribute to monthly SLA reporting
What You Bring
- 4-7 years in DevOps or Platform Engineering with at least 2 years of Kubernetes production operations experience
- You have owned a production K8 environment - not just deployed to one
- Hands-on Kafka experience in production: you have debugged consumer lag, rebalanced partitions, and dealt with broker failures
- Practical DevSecOps experience: secrets management, container security, RBAC - not just awareness
- Experience with CI/CD pipelines and Infrastructure as Code (Helm, Terraform, or Ansible)
- Comfortable working in a client-facing support model with defined SLAs and on-call responsibilities
- Experience with OpenShift or migration from OpenShift to Kubernetes is an advantage
- Banking or financial services environment experience is an advantage given the production sensitivity of this role
Technical Skills
- Kubernetes (K8): cluster admin, pod lifecycle, RBAC, Helm, ingress, network policies, upgrades - Expert level required
- Kafka: cluster operations, topic management, consumer groups, performance tuning, failure recovery - Proficient required
- DevSecOps: SAST/DAST, secrets management (Vault/Sealed Secrets), image security, RBAC, network policies - Proficient required
- CI/CD: Jenkins, GitLab CI, ArgoCD, or equivalent - Proficient required
- Infrastructure as Code: Terraform, Helm, Ansible, or equivalent - Proficient required
- Monitoring and observability: Prometheus, Grafana, ELK or EFK stack, or equivalent - Proficient required
- Scripting: Python, Bash, or Shell - Proficient required
- Cloud platforms: AWS, GCP, or Azure (any one) - Working knowledge
- OpenShift: administration and migration experience - Working knowledge
- Networking: DNS, load balancing, ingress controllers, service mesh (Istio preferred) - Working knowledge
Desired Qualifications
- Bachelor’s degree in Engineering, Computer Science, Information Technology, or equivalent
- Certified Kubernetes Administrator (CKA) is preferred
- Certified Kubernetes Security Specialist (CKS) is an advantage
- AWS, GCP, or Azure DevOps or Cloud Engineer certification is an advantage
- Confluent Certified Operator for Apache Kafka is an advantage
What We Expect From Day One
- Know your environment before your first on-call shift - read the runbooks, understand the architecture, ask questions early
- Log every incident properly from day one - no undocumented fixes, no verbal-only resolutions
- Raise alerts early - if something looks wrong, flag it before it becomes a P1
- Do not make production changes without a change record - follow the change management process from day one
- Own your on-call rotation - reliability and response time matter in a banking production environment
- Communicate clearly with the client team - timely, factual updates during incidents, no surprises
Why Choose Lera?
- I.C.E. Philosophy: Embrace Innovation, Creativity, and Experimentation.
- Impact: Play a critical role in keeping mission-critical banking data platforms stable and secure across Lera’s client portfolio in Africa and Asia.
- Culture: Thrive in a workplace that values diversity and inclusive excellence.
- Growth: Work at the intersection of platform engineering, security, and AI-led transformation across emerging markets.
- Professional Growth: Benefit from extensive opportunities for career advancement within Lera’s expanding delivery and engineering practice.
Join Us
If you are ready to take ownership of a production Kubernetes environment for a leading bank and grow your career in platform engineering and DevSecOps, please apply with your resume and a brief note on a production incident you resolved and what you learned from it.
Click on Apply to know more.