XaasIO
Website:
xaasio.com
Job details:
- Job Description: OpenStack Architect
- Primary Location: Coimbatore, Tamil Nadu
- Work Mode: On-site / Hybrid
- Company: XaasIO Systems Private Limited
- Role Type: Full-time
- Experience: 4 - 10 years preferred
Built Different: Meet XaasIO
XaasIO is looking for an experienced OpenStack Architect to design, build, operate, and modernize enterprise-grade private cloud, sovereign cloud, AI infrastructure, and hybrid cloud platforms using OpenStack, CEPH, Linux, Kubernetes, SDN, automation, backup, replication, DevSecOps, and observability tooling.
The OpenStack Architect will act as a front-end technical interface with customers, partners, OEMs, and internal delivery teams. The role requires strong hands-on OpenStack experience, excellent communication skills, strong problem-solving ability, and the capability to lead Day-0 workshops, define Day-2 operations parameters, and drive Day-1 implementation through structured quality checks and checklist-based execution.
This is a senior architecture and delivery role for candidates who can combine deep technical skills with customer-facing leadership.
Why This Role Matters
- The OpenStack Architect will be responsible for:
Designing enterprise-grade OpenStack private cloud architectures for production, regulated, and service provider environments.
- Leading OpenStack deployment, upgrade, expansion, migration, and day-2 operations projects.
Acting as the front-end technical interface with customers for architecture discussions, workshops, reviews, status updates, escalations, and solution validation.
Participating in and leading Day-0 discovery and design workshops to capture customer requirements, existing infrastructure details, business goals, security requirements, compliance requirements, operational expectations, and cloud roadmap.
- Converting Day-0 workshop inputs into:
- High-Level Design documents
- Low-Level Design documents
- Bill of Materials
- Implementation plans
- Migration plans
- Integration plans
- Risk registers
- Assumptions and dependencies
- Acceptance criteria
- Operational handover documents
- Architecting OpenStack services including:
- Nova
- Neutron
- Cinder
- Glance
- Keystone
- Horizon
- Heat
- Octavia
- Barbican
- Placement
- Ironic, where applicable
- Designing compute, storage, network, controller, and high-availability architecture for OpenStack clouds.
Working with KVM/QEMU/libvirt virtualization stack for compute design, tuning, troubleshooting, and performance optimization.
- Designing and integrating CEPH-based storage for OpenStack, including:
- RBD for Nova and Cinder
- RGW/S3 object storage
- CephFS, where applicable
- Storage pool design
- Replication, erasure coding, CRUSH rules, and failure domains
Designing OpenStack networking using Neutron, Open vSwitch, OVN, VLAN, VXLAN, provider networks, tenant networks, floating IPs, routers, security groups, BGP, and load balancing.
Working with physical network and security teams to integrate OpenStack with firewalls, load balancers, routers, ToR switches, IPAM, DNS, NTP, LDAP, SSO, IAM, SIEM, and ITSM platforms.
Designing OpenStack high availability using clustered controllers, Galera/MariaDB, RabbitMQ, HAProxy, Keepalived, Memcached, and service monitoring.
Planning and executing OpenStack upgrades, patching, version transitions, rollback strategies, and production change windows.
- Defining Day-2 operational parameters, including:
- SLA and support model
- Incident management process
- Change management process
- Backup and restore policy
- Replication and DR policy
- Monitoring and alerting thresholds
- Capacity management process
- Patch management process
- Security and compliance checks
- Escalation matrix
- Reporting and governance model
Leading Day-1 implementation planning and execution using structured checklists, test cases, QA gates, validation reports, and operational handover templates.
- Ensuring Day-1 activities are executed through a checklist-based and auditable process, including:
- Pre-requisite validation
- Network readiness checks
- Storage readiness checks
- Compute readiness checks
- Security baseline validation
- OpenStack service validation
- CEPH integration validation
- Backup and DR validation
- Monitoring and logging validation
- Customer acceptance testing
Designing backup, replication, and disaster recovery strategies for OpenStack workloads, control plane services, databases, images, volumes, and tenant environments.
Integrating backup and replication platforms through APIs for workload protection, recovery orchestration, failover, failback, and DR testing.
Building automation for cloud deployment and lifecycle operations using Ansible, OpenTofu/Terraform, scripts, and CI/CD pipelines.
Supporting DevSecOps and CI/CD pipeline integration for infrastructure validation, security scanning, image validation, IaC scanning, compliance gates, and deployment automation.
Creating design documents, HLDs, LLDs, BoMs, implementation plans, migration plans, test cases, runbooks, SOPs, QA checklists, and operational handover documents.
Identifying architecture gaps, design risks, operational pitfalls, loopholes, and improvement areas before production rollout.
Troubleshooting complex production issues related to OpenStack, CEPH, networking, KVM, storage, APIs, automation, performance, and integrations.
Mentoring engineers and platform teams on OpenStack architecture, operations, troubleshooting, automation, and best practices.
Skills You Bring to the Table
- Bachelor’s degree in Computer Science, Information Technology, Engineering, or equivalent practical experience.
Certifications in OpenStack, Linux, Kubernetes, Red Hat, Canonical, SUSE, CEPH, cloud platforms, or security will be an added advantage.
- The candidate should have strong hands-on experience in:
- OpenStack architecture, deployment, and operations
- Linux administration, preferably Ubuntu, RHEL, Rocky Linux, or AlmaLinux
- KVM/QEMU/libvirt virtualization
- OpenStack compute, storage, network, identity, image, and orchestration services
- OpenStack Neutron networking
- Open vSwitch / OVN
- CEPH storage integration with OpenStack
- RabbitMQ, MariaDB/Galera, HAProxy, Keepalived, Memcached
- OpenStack troubleshooting and log analysis
- Private cloud capacity planning and sizing
- High availability and disaster recovery design
- Backup, restore, replication, and recovery workflows
- Automation using Ansible
- Shell scripting and Python scripting
- REST API usage and infrastructure integration
- Security hardening and role-based access control
- Customer-facing technical communication
- Day-0 workshop participation and leadership
- Day-1 implementation planning and checklist-based execution
- Day-2 operations planning, SLA definition, and governance model design
- Technical documentation, architecture diagrams, and operational runbooks
- OpenStack Platform Exposure
- The candidate should have exposure to one or more OpenStack deployment models or distributions:
- Upstream OpenStack
- Kolla Ansible
- OpenStack Charms
- Red Hat OpenStack Platform / Red Hat OpenStack Services on OpenShift
- Mirantis OpenStack
- Canonical OpenStack
- OpenStack-Ansible
- TripleO, where applicable
- Vendor-neutral OpenStack operations and migration environments
- Cloud and Infrastructure Exposure
- The candidate should have experience or working knowledge in:
- Enterprise private cloud platforms
- Sovereign cloud or regulated cloud environments
- VMware to OpenStack migration
- Public cloud operations such as AWS, Azure, or GCP
- Kubernetes and container platforms
- Bare-metal provisioning
- Software-defined storage
- Software-defined networking
- NFV, firewalls, VPNs, load balancers, and routing
- IPAM, DNS, NTP, LDAP, SSO, and IAM integrations
- Monitoring, logging, alerting, SIEM, and ITSM integrations
- Backup platforms and backup API integrations
- Replication and disaster recovery platforms
- CI/CD and DevSecOps pipelines
Skills You Bring to the Table
- The following skills will be an added advantage:
- Kubernetes architecture and operations
- OpenShift or Rancher exposure
- CEPH deep operations and troubleshooting
- OpenStack upgrade experience across major releases
- OpenStack performance tuning and capacity optimization
- OpenStack billing, metering, and chargeback concepts
- CloudKitty, Gnocchi, Ceilometer, or telemetry stack exposure
- Backup tools such as Velero, Restic, Veeam, Bacula, Bareos, Commvault, Rubrik, Cohesity, or similar
- Replication and DR platforms for VM, storage, database, or Kubernetes workloads
- DevSecOps pipelines including SAST, DAST, SCA, container scanning, IaC scanning, and compliance gates
- CI/CD tools such as GitHub Actions, GitLab CI/CD, Jenkins, Argo CD, Tekton, or similar
- OpenTofu or Terraform
- GitOps practices
- Ansible Automation Platform / AWX
- Observability tools such as Grafana, Prometheus, OpenSearch, Zabbix, Wazuh, or Alertmanager
- Security and compliance tools such as OpenSCAP, Trivy, Lynis, CIS-CAT, or similar
- Active GitHub profile, open-source contributions, technical blogs, architecture diagrams, or community participation
Extra Cool If You Know
- Cloud Platform: OpenStack
- Virtualization: KVM, QEMU, libvirt
- Storage: CEPH, RBD, RGW, CephFS
- Networking: Neutron, Open vSwitch, OVN, VLAN, VXLAN, BGP, routing, firewalls
- Operating Systems: Ubuntu, RHEL, Rocky Linux, AlmaLinux
- Automation: Ansible, OpenTofu, Terraform, Python, Shell
- Database / Messaging: MariaDB/Galera, RabbitMQ, Memcached
- Load Balancing / HA: HAProxy, Keepalived, Octavia
- Containers: Kubernetes, OpenShift, Rancher
- Monitoring: Grafana, Prometheus, Zabbix, OpenSearch, Alertmanager
- Security: Keystone RBAC, Barbican, OpenSCAP, Wazuh, Trivy
- CI/CD: GitHub Actions, GitLab CI/CD, Jenkins, Argo CD, Tekton
- DevSecOps: Trivy, SonarQube, Semgrep, OWASP ZAP, Checkov, OpenSCAP
- Backup / DR: Velero, Restic, Veeam, Bacula, Bareos, or similar
- The candidate should have:
- Strong customer-facing communication skills
- Strong written and verbal communication skills
- Strong problem-solving and analytical thinking
- Ability to explain complex cloud architecture in simple business and technical language
- Ability to lead workshops, technical discussions, and architecture reviews
- Ability to handle customer escalations professionally
- Ability to work with cross-functional teams
- Ability to document decisions, risks, dependencies, and action items clearly
- Ownership mindset for delivery, quality, and customer success
- Ability to mentor junior engineers and guide implementation teams
How You’ll Make an Impact
- We are looking for someone who:
- Has deep hands-on experience with production OpenStack environments.
- Can design and troubleshoot large-scale private cloud platforms.
- Can act as the front-end technical interface with customers.
- Has strong communication, documentation, and problem-solving skills.
Can lead Day-0 workshops and convert requirements into HLD, LLD, BoM, implementation plans, checklists, and operating models.
Can define Day-2 SLA, operations model, escalation process, monitoring parameters, backup policy, DR policy, and governance model.
- Can lead Day-1 execution using structured QA gates, checklists, test cases, and acceptance criteria.
- Understands compute, network, storage, security, backup, and DR as one integrated platform.
- Can lead customer-facing architecture discussions and technical workshops.
- Can mentor implementation and operations teams.
- Is comfortable with open-source technologies and vendor-neutral cloud architecture.
- Understands enterprise change control, compliance, security, and production operations.
- Can work with distributed teams, customer delivery teams, OEMs, and partner organizations.
Perks, Culture & Growth
At XaasIO Systems Pvt.Ltd, we believe our employees are our greatest asset. We are committed to creating a workplace that fosters innovation, growth, and well-being.
- Learning & Growth
- Opportunities to work on cutting-edge technologies
- Continuous learning through training, certifications, and mentorship
- Exposure to real-time projects and global clients
- Work Culture
- Open, inclusive, and collaborative work environment
- Encouragement of new ideas and innovation
- Strong focus on teamwork and transparency
- Career Development
- Clear career progression paths
- Performance-driven growth opportunities
- Internal mobility across roles and projects
- Work-Life Balance
- Flexible work environment (WFH / hybrid)
- Paid time off and leave benefits
- Supportive policies for employee well-being
- Rewards & Recognition
- Competitive compensation and benefits
- Performance-based incentives
- Employee recognition programs
- Safe & Respectful Workplace
Strong adherence to policies aligned with the Sexual Harassment of Women at Workplace (Prevention, Prohibition and Redressal) Act, 2013
- Zero tolerance for harassment or discrimination
Summary
This is a senior architecture and customer-facing delivery role based primarily in Coimbatore for engineers who want to build hyperscaler-like private cloud, sovereign cloud, AI infrastructure, and hybrid cloud platforms using OpenStack, CEPH, Kubernetes, SDN, automation, backup, replication, DevSecOps, and open-source technologies.
The role is ideal for architects who can design, deploy, automate, troubleshoot, document, and operate large-scale OpenStack environments while leading customer workshops, defining Day-2 operating models, and ensuring Day-1 execution through structured QA and checklist-based delivery.
Click on Apply to know more.