jobs in D L RESOURCES PTE LTD

全职 HPC System Administrator - IT Infrastructure (High-Performance Computing Clusters) 工作, 薪水 up to SGD 5,000, D L RESOURCES PTE LTD Islandwide (Singapore) 公司招聘中 - Ricebowl

HPC System Administrator - IT Infrastructure (High-Performance Computing Clusters)

D L RESOURCES PTE LTD

SGD5,000 - SGD5,000 每月

Islandwide (Singapore)

分享
保存

工作地点

  • Islandwide (Singapore) Singapore

职位描述

岗位职责

HPC Administrator (Linux / Cluster / System Administrator) – 1–3 Years Experience

Role Overview

We are seeking a High Performance Computing (HPC) Administrator to support the deployment, monitoring, and maintenance of HPC clusters, Linux systems, and IT infrastructure. This role is ideal for candidates with experience in Linux system administration, cluster computing, and data center environments.

Key Responsibilities

  • Support daily operations of HPC clusters (compute, storage, networking)
  • Monitor system performance, job scheduling (Slurm, PBS, LSF), and resource utilization
  • Install, configure, and patch Linux/Unix systems (RHEL, CentOS, Ubuntu)
  • Manage CPU-based servers (bare metal and virtualized environments – VMware/KVM)
  • Perform system monitoring, health checks, troubleshooting, and incident management
  • Assist with user onboarding, access control (LDAP/AD), and environment configuration
  • Support cluster scaling, performance tuning, and HPC optimization
  • Work with networking (TCP/IP, DNS) and storage systems (NAS, SAN, parallel file systems)
  • Maintain technical documentation, SOPs, and runbooks

Required Skills & Experience

  • 1–3 years of experience in Linux System Administration / Infrastructure Support / HPC Operations
  • Exposure to HPC environments, cluster computing, or high-performance systems
  • Strong hands-on experience with:Linux OS (RHEL, CentOS, Ubuntu)Server management (physical servers, virtualization)
  • Understanding of:Networking fundamentals (TCP/IP, DNS, SSH)Storage technologies (NAS, SAN, distributed or parallel file systems)
  • Experience with scripting (Bash, Shell, Python)
  • Strong troubleshooting, problem-solving, and analytical skills

Preferred / Nice-to-Have Skills

  • Experience with GPU computing / GPU clusters (NVIDIA, CUDA)
  • Exposure to cloud platforms (AWS, Azure, GCP – HPC workloads)
  • Familiarity with monitoring tools (Prometheus, Grafana, Nagios, Zabbix)
  • Knowledge of DevOps tools (Ansible, Terraform – basic exposure)

重要安全守则

申请工作时,切勿提供您的银行或信用卡详细资料。不要转账或完成无关的在线调查问卷。如果您发现可疑内容,请举报此招聘广告。

了解更多