700+ Reliability Jobs - June 2026 - High Salaries

Showing 704 jobs results for "reliability"

Never miss any updates for Reliability jobs

Undisclosed

Singapore

  • CI/CD
  • Python
  • IaC – Terraform, Helm, Ansible, Pulumi, Bicep ...
Posted
2 days ago
Undisclosed

Singapore

  • Proficient in GPU/ML principles and cloud platforms (eg. AWS) ; Hands-on experience in GPU hardware/drivers, CUDA, NCCL, and Mellanox network operations/optimization; Data center experience preferred.
  • Familiar with cloud native container technologies and disaster recovery solutions ; Practical Docker/Kubernetes operations experience required.
  • Skilled in Linux/Shell environments; Proficient in ≥1 language ( Go/Python/Java ); Adept at leveraging automation/AI-driven methods to further enhance service stability and efficiency. ...
Posted
4 days ago
Undisclosed

Singapore

  • Experience with well-architected framework pillars (especially reliability, security, cost optimization).
  • Designing fault-tolerant and horizontally scalable systems
  • Advanced proficiency in Terraform, CloudFormation, or CDK ...
Posted
a month ago
Undisclosed

台灣

  • Assess reliability risks for liquid cooling systems (leakage, fatigue, pump life, corrosion, coolant stability).
  • Evaluate HVDC mechanical and electrical robustness (busbars, connectors, power interfaces).
  • Perform reliability prediction and life data analysis (Weibull, MTBF). ...
Posted
24 days ago
Undisclosed

Singapore

  • Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
  • Perform analytics on previous incidents and usage patterns to better predict issues and take proactive actions
  • Build and maintain CI/CD pipelines for the bank. ...
Posted
a month ago
Undisclosed

Singapore

  • Embody PT Lean Production System (LPS), while demonstrating a continuous improvement mindset and behaviors through the use and application of LPS tools for continuous improvement initiatives
  • Support reliability improvement strategy
  • Proactively analyze plant reliability & downtime data to identify current & potential defects working in collaboration with Maintenance & Utilities Operations, Quality, Technology & Manufacturing ...
Posted
15 days ago
Undisclosed

Singapore

  • Experience with well-architected framework pillars (especially reliability, security, cost optimization).
  • Designing fault-tolerant and horizontally scalable systems
  • Advanced proficiency in Terraform, CloudFormation, or CDK ...
Posted
a month ago
SGD9,000 - SGD9,000 Per Month

Singapore

  • Implement and enhance monitoring and observability solutions (Grafana, Datadog).
  • Manage incidents and improve resilience and recovery processes
  • Collaborate with IT, DevOps, and Cybersecurity teams to ensure infrastructure compliance and security. ...
Posted
5 days ago
Undisclosed

Singapore

  • CI/CD
  • Python
  • IaC – Terraform, Helm, Ansible, Pulumi, Bicep ...
Posted
6 days ago
Undisclosed

Singapore

  • Understanding of cloud computing concepts (AWS, Azure, or GCP)
  • Interest in DevOps, infrastructure, automation, and cloud technologies
  • Basic knowledge of monitoring, logging, and alerting systems ...
Posted
6 days ago
Undisclosed

Singapore

  • Performance management — track OEE and uptime KPIs across markets, flag deviations early, drive corrective action, and build local reporting habits so performance management becomes routine rather than reactive.
  • Currently pursuing/possesses a Bachelor's Degree in Engineering, preferably Electrical Engineering, Mechanical Engineering or Mechatronic Engineering, at a reputable university with strong academic credentials; A master’s degree is advantageous but not required, and expected to graduate by Jul 2026 and join us by Aug 2026.
  • Experience in technical engineering and program or project management, internship or full-time. ...
Posted
6 days ago
Undisclosed

KL City

  • Ensure adherence to SLAs, OLAs, and experience‑based KPIs, with a focus on reducing business impact.
  • Ensure accurate incident records, post‑incident reviews, and executive‑level reporting.
  • Own the Problem Management capability, ensuring strong discipline in root cause analysis and permanent remediation. ...
Posted
15 days ago
Undisclosed

台灣

  • Develop proactive monitoring and anomaly detection capabilities to identify issues before they impact users.
  • Deploy, manage, and optimize containerized workloads running on Kubernetes.
  • Maintain scalable cloud infrastructure across production environments. ...
Posted
11 days ago
SGD4,600 - SGD4,600 Per Month

Singapore

  • Support development and optimization of maintenance strategies, including preventive, predictive, and reliability-centered maintenance (RCM)
  • Track and report reliability KPIs (e.g., MTBF, MTTR, availability, OEE) and support continuous improvement initiatives
  • Contribute to reliability and performance improvement projects to enhance plant availability, reduce forced outages, and optimize cost ...
Posted
21 days ago
Undisclosed

Singapore

  • CI/CD
  • Python
  • IaC – Terraform, Helm, Ansible, Pulumi, Bicep ...
Posted
9 days ago
Undisclosed

Singapore

  • Strong knowledge of HTTP, DNS, and TLS protocols, with the ability to troubleshoot at the application and transport layers.
  • Familiarity with Content Delivery Networks (CDNs) and DDoS protection services.
  • Solid Linux fundamentals, including networking, system configuration, and troubleshooting. ...
Posted
16 days ago
Undisclosed

Tai Po

Posted
12 days ago
Undisclosed

Singapore

  • Contribute to post-incident reviews and continuous improvement initiatives.
  • Design, implement, and maintain monitoring dashboards.
  • Improve alert quality and reduce noise through effective threshold and metric design. ...
Posted
21 days ago
Undisclosed

Singapore

  • Support development and optimization of maintenance strategies, including preventive, predictive, and reliability-centered maintenance (RCM)
  • Track and report reliability KPIs (e.g., MTBF, MTTR, availability, OEE) and support continuous improvement initiatives
  • Contribute to reliability and performance improvement projects to enhance plant availability, reduce forced outages, and optimize cost ...
Posted
21 days ago
Undisclosed

Jurong West

  • Support development and optimization of maintenance strategies, including preventive, predictive, and reliability-centered maintenance (RCM)
  • Track and report reliability KPIs (e.g., MTBF, MTTR, availability, OEE) and support continuous improvement initiatives
  • Contribute to reliability and performance improvement projects to enhance plant availability, reduce forced outages, and optimize cost ...
Posted
21 days ago
SGD6,000 - SGD6,000 Per Month

Singapore

  • Security & Compliance: Enforce best practices for cloud security, access control, and compliance across environments.
  • Collaboration: Partner with backend, frontend, and product teams to ensure smooth deployments and reliable system operations.
  • Process & Mentorship: Improve DevOps processes, share best practices, and mentor junior engineers. ...
Posted
21 days ago
Undisclosed

Singapore

  • CI/CD
  • Python
  • IaC – Terraform, Helm, Ansible, Pulumi, Bicep ...
Posted
11 days ago
SGD7,000 - SGD7,000 Per Month

Singapore

Posted
a month ago
Undisclosed
Posted
22 days ago
SGD7,000 - SGD7,000 Per Month

Singapore

Posted
a month ago
Undisclosed

Singapore

  • Build robust incident management mechanism. Lead efforts to troubleshoot and resolve service incidents and postmortems. Coordinate with cross-functional teams to manage and mitigate service-impacting events.
  • Develop highly efficient toolchains covering end-to-end deployment and reliability assurance operations. Automate infrastructure provisioning, scaling, and management processes to reduce manual interventions and improve service quality. Develop and enhance system capabilities such as auto-failure-detection, auto-healing, chaotic engineering, and perform systematic disaster drills.
  • Engage with product and development teams to integrate reliability and performance considerations into the software lifecycle. ...
Posted
12 days ago
SGD6,500 - SGD6,500 Per Month

Singapore

  • Provide leadership and insights into root cause of device failures via in-depth failure analysis using intricate tools and state of the art equipment.
  • Collaborate with product engineering and R&D team to translate findings into solutions for new design improvements.
  • Monitor and use big data analysis on long term wafer reliability performance and propose improvements to product design or manufacturing process changes.
Posted
17 days ago
Undisclosed
  • Responsibilities:
  • · To attend to customer quality related feedback, request, reports and audit.
  • · To coordinate customer audits. ...
Posted
a month ago