- Location
- Hyderabad, Telangana, India
- Job type
- Full-time
Required skills
- Amazon Web Services
- Azure
- CentOS
- CI
- cloud infrastructure
- compliance
- Datadog
- DevOps
- Docker
- Git
- Jenkins
- Kubernetes
- Linux
- NAT
- Root Cause Analysis
- state management
- Terraform
- Ubuntu
- uptime
- Vault
- VPC
- web services
About the role
algoleap
Website:
algoleap.com
Job details:
Devops Engineer -JD
- Architect, design, and implement highly available, scalable, and fault-tolerant cloud infrastructure on Amazon Web Services (AWS) and Microsoft Azure.
- Design multi-account strategies, landing zones, RBAC models, governance frameworks, and cost optimization (FinOps) practices.
- Lead cloud migration initiatives from on‑premises to cloud, including re-hosting, re-platforming, and re‑architecting.
- Design and implement disaster recovery (DR) and multi-region high availability architectures.
- Build, standardize, and optimize enterprise-scale CI/CD pipelines using Jenkins and Azure DevOps.
- Implement Git branching strategies, pull request workflows, tagging strategies, and release governance.
- Integrate DevSecOps practices including automated code scanning, container image scanning, and policy enforcement within pipelines.
- Design and manage production‑grade Kubernetes clusters with autoscaling, RBAC, network policies, and workload optimization.
- Manage container lifecycle using Docker, including private registry management and image security best practices.
- Administer and manage artifact repositories using JFrog Artifactory.
- Manage and govern Kubernetes clusters through enterprise platforms such as Rafay Systems.
- Develop reusable Infrastructure as Code (IaC) modules using Terraform and leverage the Terraform Registry effectively.
- Implement remote state management, locking mechanisms, and infrastructure policy controls.
- Integrate secrets management solutions including HashiCorp Vault, Akeyless Vault, and Azure Key Vault with cloud and Kubernetes workloads.
- Implement secure key rotation, dynamic secrets, and certificate lifecycle management.
- Design and manage cloud networking including VPC/VNet architecture, route tables, subnets, NAT gateways, security groups, NSGs, and load balancers.
- Troubleshoot complex networking issues across AWS and Azure environments.
- Implement monitoring, alerting, and observability using Datadog, including dashboards, SLOs, and distributed tracing.
- Lead production incident management, perform root cause analysis (RCA), and implement preventive measures.
- Ensure high availability and zero-downtime deployments for internet‑facing production applications.
- Administer and troubleshoot Linux systems including Ubuntu and CentOS, with performance tuning and patch management.
- Support and enable data governance and data quality platforms such as Soda and Collibra at the infrastructure level.
- Drive automation initiatives to eliminate manual operational tasks and improve deployment efficiency.
- Implement infrastructure security best practices aligned with enterprise compliance requirements.
- Mentor junior engineers, conduct architecture reviews, and drive DevOps best practices across teams.
- Provide hands-on troubleshooting support for critical, internet‑facing production systems with strict uptime SLAs.
Click on Apply to know more.
This page is fully interactive when JavaScript is enabled. Please enable JavaScript to apply or browse related roles.