Summary
Overview
Work History
Education
Certification
Safety Training
Timeline
Generic

Muhammad Shahid Iqbal

Jeddah

Summary

Data Center Manager and Specialist with 18+ years of proven leadership in designing, delivering, and operating large-scale, mission-critical data center environments exceeding 4,000 sqm, supporting advanced HPC and AI platforms. Extensive experience managing high-density infrastructure, including next-generation CPU and GPU clusters, rack-scale AI systems, and hybrid cooling technologies such as direct-to-chip liquid cooling and advanced air-cooling solutions. Recognized for leading complex, high-performance environments powered by cutting-edge technologies, including HPE HPC systems, NVIDIA GPU architectures, and integrated AI platforms. Expert in overseeing critical power and mechanical infrastructure, including 16MW DRUPS systems, UPS solutions, substations, and large-scale chiller plants, ensuring maximum uptime, resilience, and operational efficiency. Proven track record in end-to-end data center project delivery—from concept and technical design through commissioning—supported by deep expertise in RFP development, SOW definition, and engineering documentation. Strong command of CAPEX/OPEX planning, financial optimization, and strategic infrastructure scalability aligned with long-term business objectives. Skilled in vendor and OEM management, SLA governance, and contract compliance, with a focus on building high-performing partnerships. Experienced in implementing CMMS and CAFM systems to enhance asset lifecycle management and maintenance strategies across critical infrastructure. A results-driven leader of 24/7 operations teams, driving high availability through advanced monitoring platforms and operational excellence frameworks. Adept at establishing governance models, SOPs, and compliance standards aligned with ISO and ITIL best practices. Committed to continuous improvement through risk management, audit programs, and workforce development, while ensuring robust Business Continuity and Disaster Recovery (BCP/DR) strategies for uninterrupted operations.

Overview

18
18
years of professional experience
8
8
Certifications

Work History

Data Center Specialist

KAUST (King Abdullah University of Science & Technology)
05.2010 - Current
  • Lead the strategic planning, design, and execution of large-scale, mission-critical data center environments exceeding 4,000 sqm of raised floor space, supporting advanced HPC platforms such as SHAHEEN-III.
  • Oversee high-density HPC and AI infrastructure, including 18 HPE CPU cabinets (4,608 nodes; AMD Genoa architecture) with direct-to-chip liquid cooling (Motivair CDUs), 7 HPE GPU Cabinets (704 nodes; NVIDIA GH200) with liquid cooling.
  • Multi-generation HPC environments (Cascade Lake, Rome, Sapphire Rapids, Skylake) and diverse GPU platforms (Tesla, Quadro, RTX series) supported by precision air-cooling systems (APC ACRC).
  • Advanced AI platforms including HPE XD690 and NVIDIA GB200 NVL72 rack-scale systems integrating Grace CPUs and Blackwell GPUs.
  • Direct the operation and optimization of critical power infrastructure, including 16MW DRUPS systems and Schneider Galaxy VL UPS solutions, ensuring maximum uptime and resilience for AI and enterprise workloads.
  • Own end-to-end data center project delivery, from concept and design through commissioning, including development of RFPs, SOWs, technical specifications, and engineering documentation.
  • Manage and optimize CAPEX and OPEX budgets, aligning financial planning with long-term infrastructure strategy, scalability, and operational efficiency.
  • Establish and enforce vendor management frameworks, including SLA governance, performance monitoring, and contract compliance, while maintaining strong partnerships with OEMs and service providers.
  • Lead preventive and corrective maintenance strategies using CMMS and CAFM platforms, improving asset lifecycle management, operational efficiency, and reporting accuracy. Drive large-scale infrastructure programs across. Power Systems, MV switchgear, substations, UPS, MCCs, RMUs, DRUPS, and generators. Cooling Systems, AHUs, DX units, in-row cooling, thermal storage, and liquid cooling solutions.Water & Fire Systems, Potable water infrastructure and fire suppression systems
  • Lead and develop 24/7 operations teams, ensuring high availability through proactive monitoring using platforms such as Schneider DCE, EcoStruxure IT Advisor, TTK leak detection, and Traka systems.
  • Establish governance frameworks including policies, SOPs, and operational standards, ensuring compliance with industry best practices, regulatory requirements, ISO, and ITIL standards.
  • Drive workforce capability development through structured training programs, OEM certifications, and continuous on-the-job coaching across operations, maintenance, and engineering domains.
  • Oversee campus-wide network infrastructure upgrades, including structured cabling, patching strategy, equipment modernization, and data center integrations.
  • Lead continuous improvement initiatives by conducting audits, risk assessments, and performance reviews, implementing change management processes to enhance reliability and efficiency.
  • Develop and manage Data Center Business Continuity and Disaster Recovery (BCP/DR) strategies, including risk assessments, redundancy planning, failover testing, and crisis response coordination to ensure uninterrupted operations under all scenarios

Data Warehouse Trainee

Digital Processing System, Pakistan-Islamabad
11.2008 - 11.2009
  • ETL development using Microsoft SQL Server integration services.
  • Development of a dashboard using Crystal Xcelsius.
  • SQL Server integration services (SSIS), Crystal Xcelsius.

Data Center Support Assistant

Interactive Group of Companies, Pakistan-Islamabad
05.2008 - 09.2008
  • Data Center Department:
  • Erecting and operations management of a Tier 4 Data Center with 20 Rack capacity with 24000 BTU/hr.
  • HVAC, Redundant UPS Arrays of 120 KVA each, Redundant Generator Power Supply of 250 KVA each.
  • Dual Control Panel Fire Suppression Systems, Double Layer Biometric Security, and 24/7 Online Monitoring.
  • Systems. The D.C. is delivering services on SDN methodology to the mentioned systems and other clients.

Education

Master’s - Information Technology

Bahauddin Zakariya University (BZU)
Pakistan
01-2007

Bachelor of Science - Computer Science

Bahauddin Zakariya University (BZU)
Pakistan
01-2004

Certification

PMI:

Safety Training

  • Manual Handling for low-risk environments - Citation Approved Health & safety (Approved by ROSPA E-Learning Course) Ladders and Stepladders - Citation Approved Health & safety.
  • Managing Contractors - Citation Approved Health & Safety.
  • ISO 45001 Awareness - Citation Approved Health & Safety.
  • ISO 9001 Awareness - Citation Approved Health & Safety.
  • Lone working - Citation Approved Health & Safety.
  • Cisco Network Administration Training at CORVIT Training Institute, Islamabad.
  • VESDA (Very Early Smoke Detection Alarm System) Training at KAUST KSA.
  • Successfully completed the Effective Communication course in March 2017 at KAUST, KSA.
  • Successfully completed Customer Centricity & Focus course in February 2017 at KAUST, KSA.
  • Data Center design and Audit Seminar presented by Capitoline and supported by IET & KAUST.
  • KAUST Global IT Summit 2017.

Timeline

Data Center Specialist

KAUST (King Abdullah University of Science & Technology)
05.2010 - Current

Data Warehouse Trainee

Digital Processing System, Pakistan-Islamabad
11.2008 - 11.2009

Data Center Support Assistant

Interactive Group of Companies, Pakistan-Islamabad
05.2008 - 09.2008

Bachelor of Science - Computer Science

Bahauddin Zakariya University (BZU)

Master’s - Information Technology

Bahauddin Zakariya University (BZU)
Muhammad Shahid Iqbal