We are seeking a detail-oriented and technically skilled Systems Operations Engineer to join our IT team. The role is responsible for maintaining and optimizing Windows, Linux, and cloud-based systems that support critical resort operations. You’ll work across hybrid environments (on-premise and cloud), implement automation and monitoring tools, respond to incidents affecting guest services, and ensure high system availability for 24/7 resort operations. This role is key to ensuring delivery of uninterrupted, reliable systems and ICT infrastructure, contributing to IT division's goal of providing excellent IT service management.
Role Responsibilities:
System Operations & Maintenance
- Manage, monitor, and maintain Windows and Linux-based systems across hybrid environments (on-premise and cloud).
- Ensure continuous operation of critical resort systems including PMS (Property Management Systems), POS, booking engines, surveillance, and access control systems.
- Manage, monitor and maintain supported databases across hybrid environment (on-premise and cloud).
Cloud & Hybrid Infrastructure
- Operate and maintain cloud services (AWS, Azure) integrated with on-premise data centers.
- Plan, coordinate and support disaster recovery and high-availability configurations for critical hospitality and gaming systems.
Monitoring & Reliability
- Implement proactive monitoring using tools like SolarWinds, Datadog or Azure Monitor to prevent downtime in high-traffic systems.
- Quickly respond to incidents affecting guest services or operations and conduct thorough root cause analysis.
- Automation & Configuration Management using tools like Ansible, Terraform, or PowerShell to automate provisioning and configuration.
- Maintain configuration baselines for servers supporting hotel, casino, and resort services.
- Design and develop RWS observability platform comprising of monitoring, metrics, and logging systems.
- Conceptualize and implement early anomaly detection (reduction of mean-time issue identification), pattern analysis, self-healing, infrastructure resizing, noise reduction and outage prediction.
- Develop visualizations in Kibana/Grafana or equivalent to provide a single pane view for end user experience, application, infrastructure & security.
- Collaborate with the Application and IT Business partner teams to develop metrics measuring the performance against initiatives and report on those to stakeholders.
Security & Compliance
- Enforce security best practices across all systems in compliance with industry standards such as PCI-DSS (for gaming/payments) and GDPR (for guest data) including reviewing of hardening standards.
- Manage patching and endpoint security to protect infrastructure.
Collaboration & Documentation
- Work closely with IT, facilities, and service departments to align infrastructure with business needs.
- Maintain detailed documentation of systems, procedures, and service records.
Role Requirements:
Required Qualifications
- 8+ years’ experience in systems administration or SysOps, preferably in a 24/7 operational environment (hospitality, casino, or similar)
- Knowledge of compliance requirements: PCI-DSS, GDPR, ISO/IEC 27001
- Certifications: AWS SysOps Administrator, Microsoft Certified: Azure Administrator, CompTIA Security+
Skills
- Scripting experience in PowerShell, Bash, or Python
- Strong attention to detail and commitment to uptime
- Ability to work calmly under pressure and during major events or peak resort hours
- Clear written and verbal communication for working with technical and non-technical teams
- Flexible schedule availability (some weekend/evening/on-call rotation may be required)