Luxoft
Website:
luxoft.com
Job details:
Project description
Monitoring as a Platform is a platform that is the first critical step towards a self-managed infrastructure, and includes capabilities like real-time monitoring, intelligent networks, self-healing, and IoT to achieve improved productivity, organizational agility, and improved employee experiences.
Responsibilities
- Cloud Architecture & Infrastructure as Code (IaC)
- Lead the design and implementation of highly available, multi-region AWS architectures with a primary focus on EKS (Elastic Kubernetes Service).
- Extensive Terraform Tooling: Develop, maintain, and version-control modular Terraform templates to manage complex cloud resources, ensuring 100% of infrastructure is codified and reproducible.
- Configuration Management: Utilize Ansible Playbooks for OS-level hardening, application configuration, and hybrid-cloud task automation.
- Kubernetes Orchestration & Security
- K8s Lifecycle Management: Manage the full lifecycle of EKS clusters, including upgrades, node group optimization, and cost management.
- Security & Governance: Implement and enforce Kubernetes security best practices, including Service Accounts (IRSA), Network Policies, RBAC, and integrated Secrets Management (e.g., HashiCorp Vault or AWS Secrets Manager).
- Containerization: Lead the effort to containerize complex legacy applications and optimize configuration patterns within Kubernetes.
- CI/CD & SRE Automation
- GitHub Actions Excellence: Design and optimize high-speed GitHub Actions workflows for automated testing, security scanning, and seamless deployments.
- SRE Scripting: Develop advanced automation scripts (Python, Go, or Bash) to eliminate "toil," automate self-healing, and perform capacity planning.
- Observability & Monitoring: Build and maintain comprehensive Grafana dashboards to monitor Pod/Service performance. Deploy and configure Beats agents (Filebeat, Metricbeat) as DaemonSets to ensure deep visibility into container logs and system metrics.
Mandatory skills
- 8+ years of professional experience in Cloud Engineering/DevOps, with at least 5 years of focused Kubernetes administration.
- Cloud Mastery: Expert-level knowledge of AWS (VPC, IAM, EKS, RDS, Route53, S3, Gateway API, Lambda).
- IaC: Expert in Terraform (Reusable modules, state management).
- Automation: Strong proficiency in Ansible and SRE-focused scripting.
- CI/CD: Deep experience with GitHub Actions.
- Container Ecosystem: Expert knowledge of Docker, K8s networking, and the ELK/Beats stack.
- Monitoring: Mastery of Grafana and Prometheus for performance tuning.
- Certification: (Highly Preferred) CKA (Certified Kubernetes Administrator) or AWS Certified Solutions Architect
- Professional.
- Scripting Languages: Python, Shell Script
- Architectural Vision: Ability to translate business requirements into scalable technical blueprints.
- Mentorship: Proven track record of guiding junior and senior engineers through complex technical hurdles.
- Incident Leadership: Experience leading Root Cause Analysis (RCA) and post-mortem discussions to improve system resilience.
Click on Apply to know more.