GreyOrange
Website:
greyorange.com
Job details:
We are looking for a highly skilled Staff CloudOps Engineer to build, scale, and optimise cloud infrastructure with a strong focus on reliability, automation, and performance. This role combines deep hands-on engineering with technical leadership, driving best practices across cloud platforms (AWS/GCP/Azure), DevOps, and SRE. You will work closely with engineering and product teams to ensure systems are scalable, resilient, and operationally efficient.
Cloud Infrastructure And Engineering
The candidate will have responsibilities across the following functions
:
- Design, build, and manage scalable and secure cloud infrastructure.
- Implement best practices for high availability, fault tolerance, and disaster recovery.
- Optimise cloud environments for performance and cost efficiency.
Cloud Operations And Reliability
- Drive operational excellence through automation, monitoring, and incident management.
- Define and maintain SLIs, SLOs, and SLAs.
- Troubleshoot complex production issues and ensure system reliability.
DevOps And Automation
- Build and maintain CI/CD pipelines for seamless deployments.
- Implement Infrastructure as Code (Terraform preferred, CloudFormation/ARM/Bicep).
- Automate repetitive operational tasks and improve deployment velocity.
Containers And Platform Engineering
- Work with containerisation and orchestration tools (Docker, Kubernetes - EKS/AKS/GKE).
- Support microservices architecture and platform scalability.
Security And Governance
- Implement cloud security best practices, IAM policies, and compliance standards.
- Ensure governance around cost, access, and monitoring.
Technical Leadership And Collaboration
- Mentor junior engineers and drive adoption of best practices.
- Collaborate with cross-functional teams to deliver reliable cloud solutions.
- Contribute to architectural discussions and technical decision-making.
Requirements
- Bachelor's degree in computer science, engineering, or a related field.
- 8-12 years of experience in CloudOps, DevOps, or SRE roles.
- Strong hands-on experience with AWS / Azure / GCP.
- Expertise in containers, Kubernetes, and microservices-based systems.
- Proficiency in Infrastructure as Code (Terraform preferred).
- Strong understanding of networking (VPC, DNS, load balancing) and cloud security.
- Experience with monitoring tools (CloudWatch, Prometheus, Grafana, Datadog, New Relic).
- Proven experience in automation, scalability, and reliability engineering.
Preferred Qualifications
- Cloud certifications (AWS / Azure / GCP).
- Experience with SRE practices and incident management frameworks.
- Knowledge of FinOps and cloud cost optimisation.
- Exposure to multi-cloud or hybrid cloud environments.
Key Competencies
- Strong problem-solving and debugging skills.
- Hands-on, execution-focused mindset.
- Ability to influence technical decisions and guide teams.
- Focus on automation, reliability, and continuous improvement.
This job was posted by Dhruv Parashar Talent Acquisition from GreyOrange.
Click on Apply to know more.