jobs in Vouch Recruitment

全职 DC Operation Engineer 工作, 薪水, Vouch Recruitment 公司招聘中 - Ricebowl

DC Operation Engineer

Vouch Recruitment

Undisclosed

Singapore

分享
保存

工作地点

  • Singapore

职位描述

岗位职责

Our client is a leading technology and digital infrastructure provider focused on delivering next-generation AI and High-Performance Computing (HPC) solutions through advanced GPU-powered data centre platforms.


With significant investments in cutting-edge data centre technologies, the organization is at the forefront of supporting AI innovation, machine learning workloads, and large-scale compute-intensive applications. Their environment combines modern data centre operations, liquid cooling technologies, and mission-critical infrastructure to power enterprise and AI-driven workloads.

This opportunity offers hands-on exposure to state-of-the-art GPU data centre operations and facilities management, providing a unique platform for individuals looking to build expertise in AI infrastructure, HPC environments, and the future of data centre technology. Candidates will work alongside experienced professionals while contributing to the development and operation of advanced AI-ready infrastructure.


Primary Responsibilities


Data Centre Operations Management

  • Monitor, respond to, and escalate incidents in accordance with defined service levels, operational impact, and criticality.
  • Perform hands-on operation, monitoring, and maintenance of data centre electrical, air-cooled, and liquid-cooled infrastructure.
  • Support the continuous improvement of operational processes, procedures, and service reliability within a GPU-focused data centre environment.
  • Coordinate visitor and vendor access, including security clearance requirements for entry into the GPU-as-a-Service (GPUaaS) facility.
  • Ensure vendors comply with workplace safety, health (WSH), and site regulations while performing work within the data centre.
  • Participate in after-hours support activities, including weekends, public holidays, and emergency response situations when required.


Data Centre Facilities Management

  • Monitor and maintain critical data centre infrastructure, including power systems, cooling systems, environmental controls, leakage detection systems, and related facilities.
  • Maintain accurate operational records, technical documentation, and generate reports on data centre performance and facility health.
  • Collaborate with internal stakeholders, engineering teams, and vendors to resolve operational, technical, and process-related issues.
  • Ensure strict adherence to Standard Operating Procedures (SOP), Method of Procedures (MOP), and Emergency Response Procedures (ERP) for mission-critical operations.
  • Provide expertise and operational support for both air-cooled and liquid-cooled server environments, contributing to capacity planning and infrastructure optimization.
  • Coordinate maintenance activities, system shutdowns, and change management processes to ensure maximum uptime and operational reliability.
  • Prepare and present monthly facility performance and health status reports.
  • Identify, assess, and mitigate operational, safety, and environmental risks within the data centre environment.
  • Conduct routine inspections of server infrastructure, cooling distribution units, and associated equipment.
  • Perform first-level troubleshooting of server-related issues in collaboration with remote engineering and technical support teams.


What We Are Looking For

  • Diploma in Mechanical Engineering, Electrical Engineering, Building Services, or a related technical discipline.
  • Strong understanding of data centre infrastructure, including electrical and mechanical systems, cooling technologies, fire protection systems, building management systems (BMS), and mission-critical facility operations.
  • Experience supporting and maintaining data centre equipment, particularly power, cooling, and facility infrastructure.
  • Familiarity with modern data centre technologies, including GPU-oriented environments and liquid cooling solutions, is an advantage.
  • Ability to work independently while contributing effectively within a team-oriented environment.
  • Strong organizational skills with the flexibility to adapt to changing operational priorities and schedules.
  • Excellent problem-solving and stakeholder coordination capabilities.
  • A proactive, hands-on attitude with a strong willingness to learn and develop expertise in next-generation GPU and AI infrastructure technologies.
  • Willingness to support a 24/7 operational environment, including after-hours, weekend, and public holiday coverage when required.


Click on Apply now to find out more about this opportunity and other available positions.


EA License: 22C1396

EA Personnel: R1551466

重要安全守则

申请工作时,切勿提供您的银行或信用卡详细资料。不要转账或完成无关的在线调查问卷。如果您发现可疑内容,请举报此招聘广告。

了解更多