Head of System & Operations (GPU Infrastructure)

  • Kuala Lumpur
  • Permanent
  • Full-time
  • 1 month ago
Are you a technology leader with deep expertise in cloud operations, GPU infrastructure, and IT systems YTL AI Cloud is seeking a Head of System & Operations to lead the design, development, and management of our cutting-edge GPU cluster infrastructure powering AI and high-performance computing workloads. This is a strategic and hands-on leadership position responsible for ensuring the high availability, scalability, and performance of our GPU-as-a-Service platform. You will lead a team of engineers, work across technical functions, and play a key role in shaping the operational backbone of our AI Cloud infrastructure. Key Responsibilities Lead and manage the end-to-end operations of our GPU cluster infrastructure and platform services. Define and implement operational strategies that align with business goals for reliability, performance, and cost efficiency. Oversee complex IT systems across infrastructure, deployment, and customer delivery. Provide leadership and mentoring to a team of technology professionals. Drive continuous improvement in operational processes and technical systems. Collaborate with cross-functional teams including product, sales, and marketing to ensure smooth service integration. Maintain compliance with data protection, sustainability standards, and industry regulations. What We're Looking For Bachelor's degree in Computer Science, Engineering, or related field. 10+ years in IT/cloud operations or infrastructure leadership roles. At least 5 years in a team leadership capacity. Strong technical background in GPU and CPU cluster systems , Kubernetes, and cloud infrastructure. Hands-on problem solver with excellent analytical and troubleshooting skills. Comfortable in a dynamic, fast-paced environment with shifting priorities. Familiarity with energy-efficient data center operations is a plus. Bonus Skills Proven experience managing scalable infrastructure platforms. Exceptional communication and stakeholder management abilities. Strong documentation and process development capabilities. Vendor and SLA management experience. Why Join Us Be part of a visionary team building the next-generation AI cloud platform in Southeast Asia. Drive innovation in one of the most technically advanced environments in Malaysia. Competitive compensation and opportunity for growth in a fast-scaling AI infrastructure business.

foundit

Similar Jobs

  • Head of IT Operations

    Tune Protect

    • Kuala Lumpur
    POSITION SUMMARY: The Head of IT Operations will lead the development and implementation of IT operations strategies and roadmaps, ensuring alignment with the overall Group IT st…
    • 6 days ago
  • Head of Platform Operations

    • Petaling Jaya, Selangor
    Job Description In this role at CelcomDigi, you will be responsible for the end-to-end stability, performance, and scalability of digital platforms that support our consumer and en…
    • 18 days ago
  • Head of Operations

    • Kuala Lumpur
    JOB DESCRIPTION The Head of Operations, who reports to the Chief Operating Officer of Taylor's Schools will be responsible for leading the operations team to provide smooth, effici…
    • 8 days ago