ZyBiSys
Website:
zybisys.com
Job details:
Role Overview
We are seeking an experienced NOC Manager to lead and manage Network Operations Centre (NOC) activities, ensuring high availability, performance, and reliability of infrastructure and services. The role involves overseeing enterprise infrastructure monitoring, managing operational teams, and ensuring compliance with Service Level Agreements (SLAs).
The ideal candidate will have strong experience in data center operations, incident management, infrastructure monitoring, and team leadership, along with the ability to handle enterprise customer environments and critical escalations. Experience supporting FinTech platforms and high-availability financial transaction systems will be an added advantage.
Key Responsibilities:
NOC Operations Management
• Lead and manage 24x7 NOC operations to ensure continuous monitoring of infrastructure, networks, and systems
• Oversee monitoring of servers, network devices, storage systems, and applications
• Ensure rapid identification, troubleshooting, and resolution of operational incidents
• Maintain monitoring of critical platforms including FinTech and financial transaction systems
Team Leadership
• Manage and mentor NOC engineers and shift leads
• Define shift structures, operational processes, and escalation procedures
• Conduct training sessions and performance reviews
• Foster a culture of operational excellence and proactive monitoring
Incident & Escalation Management
• Act as the primary escalation point for critical incidents and outages
• Ensure timely resolution and minimize service disruptions
• Lead Root Cause Analysis (RCA) for major incidents
• Implement corrective and preventive actions
Data Center & Infrastructure Oversight
• Monitor and manage data center infrastructure including servers, networking equipment, storage, and virtualization platforms
• Coordinate with infrastructure, security, and engineering teams
• Ensure stability of high-availability environments
SLA & Service Delivery Management
• Ensure adherence to SLAs and operational KPIs
• Track performance metrics and drive continuous improvement
• Handle enterprise escalations and ensure effective communication
Enterprise Customer Support
• Support enterprise customers and large-scale infrastructure environments
• Provide operational updates during major incidents
• Ensure monitoring of mission-critical services
Automation & Process Improvement
• Identify automation opportunities in monitoring and incident response
• Improve MTTD and MTTR
• Drive continuous improvement initiatives
Reporting & Compliance
• Prepare operational, incident, and SLA reports
• Ensure compliance with internal policies and standards
• Provide insights on performance and availability
Required Skills & Qualifications:
• 15+ years of IT experience with 8+ years in NOC/Infrastructure operations
• Experience managing 24x7 NOC teams
• Strong understanding of data center infrastructure and monitoring tools
• Expertise in incident, problem, and escalation management
• Networking knowledge: BGP, MPLS, VXLAN, EVPN, SD-WAN
• Experience with tools such as SolarWinds, Nagios, Zabbix, PRTG
• Exposure to cloud platforms (AWS, Azure, Google Cloud)
• Strong leadership, communication, and decision-making skills
• Experience handling enterprise and critical environments
Preferred Qualifications:
• Bachelor’s degree in IT / Computer Science / Engineering
• Certifications such as ITIL, CCNA/CCNP, CCIE, RHCE
• Experience in FinTech or high-availability transaction environments
About Us:
At Zybisys, we build and manage robust, secure, and scalable technology infrastructure for modern enterprises. From data center operations to cybersecurity, we enable businesses to run mission-critical systems with high availability and performance, backed by a strong commitment to operational excellence.
Click on Apply to know more.