700+ Reliability Jobs - June 2026 - High Salaries

Showing 737 jobs results for "reliability"

Never miss any updates for Reliability jobs

Undisclosed

KL City

  • Collaboration: Work with development squads to ensure new features are designed with reliability in mind; participate in Agile ceremonies
  • Incident Management: Conduct root cause analysis for incidents and implement corrective actions to prevent recurrence; participate in on-call rotations for critical systems
  • Continuous Improvement: Drive initiatives to improve system performance, reliability, and scalability through best practices. ...
Posted
18 days ago
SGD7,000 - SGD9,000 Per Month

Singapore

  • - 处理突发重大故障和普通故障,进行服务恢复。分析事件的根本原因,并改进和优化。
  • - 开发和维护自动化运维工具,提高运维工作效率,优化运维流程。
  • - 提供 7 * 24 OnCall 技术支持服务,5 * 8工作时间服务。 ...
Posted
18 days ago

Truewatch Technology Inc. Pte Ltd

SGD7,000 - SGD10,000 Per Month

Singapore

  • Manage and optimize cloud infrastructure on AWS, Azure, or GCP to improve resource utilization and cost efficiency
  • Implement Infrastructure as Code (IaC) and automate deployments through CI/CD pipelines to accelerate delivery and reduce errors
  • Enhance system scalability, resilience, and operational efficiency by identifying and applying improvements ...
Posted
18 days ago
Undisclosed
  • Collaborate with cross-functional teams to integrate tribological insights into product development and optimization
  • Present findings and recommendations to diverse audiences, including management and global teams
  • Contribute to the development of new testing methods and equipment to enhance tribological analysis capabilities ...
Posted
18 days ago
Undisclosed

Singapore

  • CI/CD golden path - Codify Cloud Build pipelines and automated canary rollouts for Cloud Functions / Cloud Run.
  • Infrastructure as Code - Manage GCP resources; embed security, IAM least-privilege, and cost controls by default.
  • Performance & cost tuning - Profile hot paths (BigQuery, Firestore, Pub/Sub), and implement caching or concurrency improvements to keep user latency ...
Posted
19 days ago
Undisclosed

Singapore

  • Unplanned downtime reduction metrics
  • Private Health Insurance
  • Training & Development ...
Posted
19 days ago
Undisclosed

Singapore

  • Monitor system performance, troubleshoot issues, and ensure optimal operation
  • Partner with development teams to improve system reliability and performance at the code and architecture level
  • Develop and implement automation tools to streamline operations and reduce toil ...
Posted
19 days ago
Undisclosed
  • Review Preventive Maintenance plan to be dynamic and suit to real condition of plant.
  • Defining standards & procedure.
  • Coordinating Maintenance Program within internal & external team. ...
Posted
19 days ago
MYR19,000 - MYR19,000 Per Month

KL City

  • Conduct thorough post-mortem analyses following incidents, driving continuous improvement through root cause identification and solution implementation.
  • Collaborate with development and operations teams to establish best practices in system reliability and incident management.
  • Troubleshoot and resolve issues related to database performance, network connectivity, and deployment failures, including diagnosing problems at the underlying platform level (e.g., Kubernetes, virtual machines). ...
Posted
19 days ago
Undisclosed

Singapore

Posted
19 days ago
Undisclosed

KL City

  • Implement monitoring, alerting, SLIs, SLOs, and SLA tracking.
  • Participate in 24/7 on-call rotations and incident response activities.
  • Conduct root cause analysis and support post-mortem reviews. ...
Posted
19 days ago
Undisclosed

Singapore

  • Build and maintain production tooling that supports deployment, orchestration, monitoring, and system diagnostics
  • Define and maintain observability, SLI/SLOs, and performance metrics in partnership with product owners
  • Leverage metrics and capacity planning to ensure scalability and uptime ...
Posted
19 days ago
Undisclosed
WFH

Singapore

  • Apply SRE principles to Customer Success - enabling customers and team members to monitor and proactively assist our most important customers.
  • Detect issues commonly occurring in the platform, either underlying or immediate, and work with teams to ensure their priority is recognised.
  • Proactively find improvements in the platform and methods of implementation that can unblock them ...
Posted
19 days ago

Actiforce Mechatronics Technology (M) Sdn Bhd

MYR1,700 - MYR2,500 Per Month
  • Electrical safety checks
  • Noise and vibration measurements
  • Operate testing equipment such as load cells, data loggers, multimeters, force gauges, sound level meters ...
Posted
19 days ago
Undisclosed

Singapore

  • Programming: High proficiency in Python and Java for developing network management platforms and automation scripts.
  • Observability Tools: Hands-on experience with Grafana, Elasticsearch,
  • CI/CD : Experience building automated pipelines (Jenkins, bitbucket, jira) for validating network changes before production deployment.
Posted
13 days ago
Undisclosed

Singapore

Posted
13 days ago
Undisclosed

Singapore

  • Work with passionate teammates who value innovation, collaboration, and customer success.
  • Grow your career in a culture that champions continuous learning and fast career development.
  • Market-competitive compensation, global exposure, and a vibrant, creativity-fueled work atmosphere. ...
Posted
13 days ago

APPLIED MATERIALS SOUTH EAST ASIA PTE. LTD.

SGD3,500 - SGD3,500 Per Month

Singapore

  • Provide basic maintenance & care to tools and machines.
  • Maintenance of workplace to meet 5S and safety requirement.
  • Support setups from established procedures where applicable. ...
Posted
20 days ago
Undisclosed

KL City

  • Manage cloud infrastructure provisioning and configuration using IaC tooling (Terraform, Helm), supporting both AWS/Azure cloud deployments and on-premises customer environments.
  • Implement and maintain CI/CD pipelines for GFS solutions (Jenkins, etc.)
  • Work with Engineering teams to ensure security and compliance readiness for Managed services — including PCI DSS, ISO 27001, SOC 1/2/3, PDPA/GDPR — in close coordination with InfoSec teams. ...
Posted
13 days ago
Undisclosed

KL City

  • Manage cloud infrastructure provisioning and configuration using IaC tooling (Terraform, Helm), supporting both AWS/Azure cloud deployments and on-premises customer environments.
  • Implement and maintain CI/CD pipelines for GFS solutions (Jenkins, etc.)
  • Work with Engineering teams to ensure security and compliance readiness for Managed services — including PCI DSS, ISO 27001, SOC 1/2/3, PDPA/GDPR — in close coordination with InfoSec teams. ...
Posted
13 days ago
Undisclosed

KL City

  • Manage cloud infrastructure provisioning and configuration using IaC tooling (Terraform, Helm), supporting both AWS/Azure cloud deployments and on-premises customer environments
  • Implement and maintain CI/CD pipelines for GFS solutions (Jenkins, etc.)
  • Work with Engineering teams to ensure security and compliance readiness for Managed services — including PCI DSS, ISO 27001, SOC 1/2/3, PDPA/GDPR — in close coordination with InfoSec teams ...
Posted
13 days ago
Undisclosed

Singapore, Singapore

  • Ensure all works carried out by Maintenance team conform to the required Safety standards and Procedures, eg PTW compliance, Risk assessment, MOC compliance, etc
  • Drive Root Cause Analysis (RCA) and implement sustainable corrective actions
  • Own and manage key reliability KPIs (not limiting), including: ...
Posted
20 days ago
Undisclosed

Singapore

  • Experience with well-architected framework pillars (especially reliability, security, cost optimization).
  • Designing fault-tolerant and horizontally scalable systems
  • Advanced proficiency in Terraform, CloudFormation, or CDK ...
Posted
20 days ago

Amazon Innovation Center (Shenzhen) Company Limited - O82

Undisclosed

台灣

  • Lead and drive on DFMEA, debugging, failure analysis, DOEs and fixing issues discovered during testing.
  • Evaluate and develop reliability test methodologies to reduce test time and increase test coverage.
  • Evaluate reliability risks and do escalations to management team. ...
Posted
13 days ago
Undisclosed

Singapore

  • • Engineering-driven culture with strong investment in cloud infrastructure, stability, and platform scalability
  • Responsibilities:
  • • Ensure system reliability, scalability, and production stability across core business services ...
Posted
14 days ago
Undisclosed

Singapore

  • Build and maintain production tooling that supports deployment, orchestration, monitoring, and system diagnostics
  • Define and maintain observability, SLI/SLOs, and performance metrics in partnership with product owners
  • Leverage metrics and capacity planning to ensure scalability and uptime ...
Posted
20 days ago
Undisclosed

KL City

  • Handle major incidents and day-to-day operational issues, restore services efficiently, perform root cause analysis, and drive long-term improvements.
  • Design, develop, and maintain automated operations tools to improve efficiency and optimize operational workflows.
  • Provide 7×24 on-call technical support, with standard 5×8 working hours coverage. ...
Posted
20 days ago
Undisclosed
  • Review Preventive Maintenance plan to be dynamic and suit to real condition of plant.
  • Defining standards & procedure.
  • Coordinating Maintenance Program within internal & external team. ...
Posted
20 days ago
Undisclosed

Singapore

  • Experience with well-architected framework pillars (especially reliability, security, cost optimization).
  • Designing fault-tolerant and horizontally scalable systems
  • Advanced proficiency in Terraform, CloudFormation, or CDK ...
Posted
21 days ago
Undisclosed
  • Support change management review boards by supplying reliability data, analysis, and technical recommendations for approval decisions.
  • Lead or support reliability investigations, including qualification failures and backend process change assessments.
  • Serve as a key reliability interface to support customer audits, reliability reviews, and technical inquiries. ...
Posted
2 days ago