Flag job

Report

DevOps Engineer

Min Experience

5 years

Location

New Delhi

JobType

full-time

About the job

Info This job is sourced from a job board

About the role

Yotta Data Services is looking for Site Reliability Engineers to manage mission-critical cloud infrastructure for customers globally. This role involves ensuring the smooth continuity of multiple production environments by performing necessary actions to maintain and enhance them. Responsibilities Monitor production environments, ensuring availability and system health. Provide predictive insights to optimize and safeguard against future abnormalities. Build and manage platform infrastructure and applications. Improve reliability, quality, and time-to-market of cloud and on-prem software solutions. Optimize system performance while anticipating customer needs. Provide operational support and engineering for large-scale distributed infrastructure and applications. Requirements Experience: 5+ years in managing large-scale infrastructure and cloud systems. Ability to gather and analyze metrics for performance tuning and fault finding. Partner with development teams to improve services through testing and release procedures. Experience in system design consulting, platform management, and capacity planning. Expertise in automation and maintaining services through automation. Technical Skills Expertise in Terraform or Ansible. Strong knowledge of Linux, MySQL, and scripting using Bash and Python. Experience in maintaining on-prem cloud solutions like OpenStack, CloudStack, OpenNebula, etc. In-depth experience with Kubernetes and container orchestration. Experience with monitoring systems like Prometheus, Nagios, Zabbix, etc. Hands-on experience in maintaining high availability systems and ensuring business continuity. Bonus Attributes Knowledge of CloudStack/Citrix CloudPlatform. Data center or ISP experience. Knowledge of GPU-based systems, Nvidia BCM, GPU Virtualisation. Experience in supporting AI/ML workloads.

Skills

terraform
ansible
linux
mysql
bash
python
openstack
cloudstack
opennebula
kubernetes
prometheus
nagios
zabbix