Flag job

Report

Senior DevOps & HPC Engineer

Salary

₹40 - 65 LPA

Min Experience

5 years

Location

India

JobType

full-time

About the job

Info This job is sourced from a job board

About the role

As Senior DevOps & HPC Engineer, you will design, automate, and run our hybrid cloud/HPC infrastructure—GPU clusters, data-intensive pipelines, and CI/CD workflows that turn research code into reliable production services. You own uptime, scalability, security, and cost efficiency across the entire compute stack. Key Responsibilities Infrastructure design – stand up and evolve Kubernetes or Slurm‑backed GPU fleets across cloud and on‑prem, with IaC (Terraform/Pulumi). Automation & CI/CD – build reproducible environments (Docker, Nix, Ansible) and pipelines that move code from PR to production with zero‑downtime deployments. Observability & SRE – implement robust logging, tracing and metrics (Prometheus/Grafana, ELK), enforce SLAs ≥ 99.5 % for APIs & batch workloads. Cost & capacity management – track utilization, forecast demand, and implement auto‑scaling / spot‑instance strategies that keep compute budgets on target. Security & compliance – harden clusters, manage secrets, ensure least‑privilege IAM; contribute to policies needed for regulated environments. Must‑Have Qualifications 5+ years operating production Linux systems with GPUs at scale (cloud, HPC, or hybrid). Deep experience with Kubernetes or Slurm and container orchestration of GPU workloads. Strong Infrastructure‑as‑Code skills (Terraform, Pulumi, or similar) and CI/CD (GitHub Actions, GitLab, Jenkins, Argo). Proficiency in at least one cloud platform (AWS, GCP, Azure) including networking, IAM, and cost controls. Solid scripting/coding ability (Python, Bash, Go) for automation and custom tooling.

About the company

Grafton Biosciences is a San Francisco-based biotech startup (with offices in Bengaluru) focused on solving disease through groundbreaking innovations in early detection and therapeutics. We are combining cutting-edge synthetic biology, machine learning, and manufacturing to fundamentally extend healthy human lifespans. We're looking for passionate team members who want to shape the future.

Skills

kubernetes
slurm
terraform
pulumi
github actions
gitlab
jenkins
argo
aws
gcp
azure
python
bash
go