Website:
viridium.ai
Job details:
Company DescriptionAt Viridium.AI, We are driven by the dual opportunity to build an amazing company and make a positive impact on the world. We are building a Material Intelligence platform. Our mission is to help manufacturers swiftly and profitably identify and phase out hazardous materials, such as forever chemicals, from their products. By leveraging cutting-edge AI technology, we aim to set the standard for applying the power of AI to solve problems otherwise impossible for humans. Our AI design principles are rooted in responsible AI, adhering to the laws of physics and nature, designed to unveil insights beyond human analytical capabilities, to automate routine tasks, offering ease of walking on an escalator and hence making meeting subsequent challenges easier and more cost effective.
What this role isWe’re building cloud systems that have to work—reliably, at scale, without babysitting.
We need a hands-on DevOps / SRE who can own infrastructure end-to-end on Azure, automate aggressively, and keep production stable.
What you’ll do- Design and run Azure infrastructure (VNet, App Service, Storage, Key Vault, PostgreSQL, ACR)
- Build everything as code using Terraform
- Set up and maintain CI/CD pipelines (GitHub Actions / Azure DevOps)
- Deploy and manage apps using Docker (AKS is a plus)
- Implement zero-downtime deployments
- Own monitoring & alerts (Azure Monitor, App Insights, Grafana)
- Troubleshoot production issues and fix root causes—not patch symptoms
- Lock down systems with RBAC, Key Vault, and Azure AD
- Handle networking basics (DNS, subnets, private endpoints, firewalls)
What you should have- Strong, real-world experience with Microsoft Azure
- Solid grip on Terraform and CI/CD
- Experience running production systems (not just deploying them)
- Good understanding of containers, networking, and system design
- Comfort with Linux/Windows and web servers (Nginx/IIS)
Experience- 5–15 years in DevOps / SRE / Cloud Engineering
- You’ve owned uptime before—and know what breaks in production
What matters- You automate first
- You take ownership
- You build systems that don’t fail silently
If this sounds like you, you’ll fit right in.
Click on Apply to know more.