700+ Reliability Jobs - June 2026 - High Salaries

Showing 712 jobs results for "reliability"

Never miss any updates for Reliability jobs

Undisclosed
WFH

Singapore

  • Expert-level knowledge in handling production Incidents in production-grade multi-cloud environments according to industry-standard Incident management process
  • Process handling service requests and provisioning by the customers.
  • Proven ability to manage customer escalations and drive resolution in mission-critical, high-impact production environments ...
Posted
a month ago
Undisclosed

Singapore

  • Identifying and owning automation opportunities across deployment, management, and observability
  • Participating in design reviews and operational readiness exercises
  • Working on a follow-the-sun model with regional and global colleagues ...
Posted
14 days ago
SGD4,500 - SGD4,500 Per Month

Singapore

  • Develop new FA capabilities/techniques to enhance detection capability or productivity or optimize recipes.
  • Drive improvement activities such as robustness and cycle time improvement as well as cost reduction activities.
  • Conduct FA staff training on FA techniques and procedures. ...
Posted
a month ago
Undisclosed

Singapore

  • Keep TikTok running smoothly across continents and time zones
  • Develop systems for mapping, capacity planning, disaster recovery, and incident automation
  • Test our systems with chaos engineering so they can handle anything thrown at them ...
Posted
9 hours ago
Undisclosed

Singapore

  • Measure and monitor availability, latency and overall service health.
  • Practice sustainable incident response and postmortems.
  • Participate in on-call rotations across continents. ...
Posted
2 days ago
SGD16,500 - SGD16,500 Per Month

Singapore

  • Ensure essential procedures are followed and contribute to defining standards
  • Integrate in-depth knowledge of applications development with overall technology function to achieve established goals
  • Provide evaluative judgement based on analysis of facts in complicated, unique, and dynamic situations including drawing from internal and external sources ...
Posted
3 days ago
SGD6,000 - SGD6,000 Per Month

Singapore

  • Root Cause and Resolution of Qual and RMA Device Issues pertaining to defectivity and intrinsic reliability: Debug and identify root cause and failures in reliability tests by electric failure analysis(EFA) and Physical Failure Analysis (PFA) and drive for resolution and improvements through cross functional team collaboration.
  • Process Conversions and HVM: The role involves providing recommendation to fab teams for new process conversions to reduce cost, increase yields. This is critical as it directly impacts the yield, quality, reliability, and performance of HBM products, which are key components in many modern technologies.
  • Risk Management: The role requires communication with Product Managers/Leads to manage risks associated with DPM process conversions. This is crucial in ensuring that the conversions move at an appropriate pace, balancing the need for innovation with the need for stability and reliability. ...
Posted
3 days ago
Undisclosed
  • Manage inventory for spare parts and sample tracking list
  • Perform lab housekeeping by following 7s standard
  • Shipping and collection of delivery items from store ...
Posted
a month ago
Undisclosed
  • Manage inventory for spare parts and sample tracking list
  • Perform lab housekeeping by following 7s standard
  • Shipping and collection of delivery items from store ...
Posted
a month ago
SGD10,000 - SGD10,000 Per Month

Singapore

  • Involved in implementation and rollout of high performance, large scale security platforms
  • Onboard, maintain expansive data pipelines for various security platforms
  • Involved in analysis and troubleshooting of security detections to minimize false positives and improve detection ...
Posted
23 days ago
SGD6,000 - SGD6,000 Per Month

Singapore

  • Participate in on-call rotations, major incident response, and operational escalations.
  • Define and maintain SLIs, SLOs, and error budgets for critical network services.
  • Develop operational standards, documentation, SOPs, and runbooks. ...
Posted
7 days ago
Undisclosed

Singapore

  • Drive supplier and product qualification and assessment processes in support of procurement teams. This includes developing technical approaches for evaluating new equipment and suppliers entering the AWS ecosystem
  • Drive continuous improvement initiatives that address both immediate quality issues and long-term performance optimization
  • Develop and implement Key Performance Indicators (KPIs) to monitor supplier quality performance and drive accountability ...
Posted
19 days ago
SGD5,000 - SGD5,000 Per Month

Singapore

  • Cross-Functional Collaboration: This role necessitates collaboration with various cross-functional teams such as Fab, HBM Technology Development, HBM Design, System Development, and Quality/Reliability team. This collaboration is vital for the holistic development and shipping of end products.
  • Data Analysis for Validation: Utilizing in-house statistical tools for engineering data analysis for validation and risk assessment provides valuable insights into the performance and reliability of the products.
  • Hardware Specifications and Validation: Defining interface hardware specifications and performing validation and debug of new interface boards ensures that the products are compatible with various hardware and can perform optimally in different environments. ...
Posted
8 days ago
Undisclosed

Singapore

  • Develop scripts, Infrastructure as Code (IaC), and internal tools to reduce manual intervention and improve operational efficiency and recovery times.
  • Act as a point of contact with the internal security team on security alerts arising from Security Information and Event Management (SIEM), ABLR, and GCSOC, and represent the team in security-related discussions.
  • Participate in infrastructure cyber hygiene reviews with relevant teams and ensure systems comply with recognised standards such as the Centre for Internet Security (CIS), National Institute of Standards and Technology (NIST), and Government Technology Agency Instruction Manual for ICT & SS Management. ...
Posted
15 days ago
Undisclosed

Singapore

  • Participate in on-call rotations, major incident response, and operational escalations.
  • Define and maintain SLIs, SLOs, and error budgets for critical network services.
  • Develop operational standards, documentation, SOPs, and runbooks. ...
Posted
8 days ago
Undisclosed

KL City

  • Lead every Sev1/2 Incident, run the bridge, write RCA within 48H, enforce blameless post-mortems the same week, and ship permanent automated fixes so the same outage never happens twice.
  • Review team members' code scripts by evaluating adherence to better code quality standards to ensure high-quality software delivery.
  • Evolve product Observability. This includes metrics (Prometheus/Tempo), Logs (Loki/Cloudwatch), Traces (Tempo/OpenTelemetry) and proactively updates on the design, and implementation. ...
Posted
23 days ago
SGD7,000 - SGD7,000 Per Month

Singapore

  • Root Cause and Resolution of Qual and RMA Device Issues pertaining to defectivity and intrinsic reliability: Debug and identify root cause and failures in reliability tests by electric failure analysis(EFA) and Physical Failure Analysis (PFA) and drive for resolution and improvements through cross functional team collaboration.
  • Process Conversions and HVM: The role involves providing recommendation to fab teams for new process conversions to reduce cost, increase yields. This is critical as it directly impacts the yield, quality, reliability, and performance of HBM products, which are key components in many modern technologies.
  • Risk Management: The role requires communication with Product Managers/Leads to manage risks associated with DPM process conversions. This is crucial in ensuring that the conversions move at an appropriate pace, balancing the need for innovation with the need for stability and reliability. ...
Posted
8 days ago
Undisclosed

Singapore

  • Build availability of services deployed across multiple data centers globally.
  • Deliver tools/software to improve the reliability, scalability and operability of services.
  • Measure and monitor availability, latency and overall service health. ...
Posted
8 days ago
Undisclosed

Singapore

  • Performance management — track OEE and uptime KPIs across markets, flag deviations early, drive corrective action, and build local reporting habits so performance management becomes routine rather than reactive.
  • Currently pursuing/possesses a Bachelor's Degree in Engineering, preferably Electrical Engineering, Mechanical Engineering or Mechatronic Engineering, at a reputable university with strong academic credentials; A master’s degree is advantageous but not required, and expected to graduate by Jul 2026 and join us by Aug 2026.
  • Experience in technical engineering and program or project management, internship or full-time. ...
Posted
8 days ago
SGD7,000 - SGD7,000 Per Month

Singapore

  • Root Cause and Resolution of Qual and RMA Device Issues pertaining to defectivity and intrinsic reliability: Debug and identify root cause and failures in reliability tests by electric failure analysis(EFA) and Physical Failure Analysis (PFA) and drive for resolution and improvements through cross functional team collaboration.
  • Process Conversions and HVM: The role involves providing recommendation to fab teams for new process conversions to reduce cost, increase yields. This is critical as it directly impacts the yield, quality, reliability, and performance of HBM products, which are key components in many modern technologies.
  • Risk Management: The role requires communication with Product Managers/Leads to manage risks associated with DPM process conversions. This is crucial in ensuring that the conversions move at an appropriate pace, balancing the need for innovation with the need for stability and reliability. ...
Posted
9 days ago
SGD8,500 - SGD8,500 Per Month

Singapore

  • Setup, configure and integrate application with middleware components as per the architecture documentation
  • Partner with infrastructure teams to implement security hardening measures and ensure compliance with regulatory requirements and industry standards
  • Design and execute comprehensive application security testing protocols and vulnerability assessment procedures, ensuring full alignment with internal framework requirements and organisational processes ...
Posted
24 days ago
Undisclosed
  • You participate in a 24/7 on-call rotation and drive improvements using SRE practices.
  • You actively participate in toil elimination, observability and monitoring improvements, knowledge management, error budget compliance, deployment designs and testing.
  • Bachelor’s degree and/or equivalent experience in Information Technology, Computer Science or Business Management. ...
Posted
a month ago
Undisclosed

KL City

  • Strong experience in site reliability engineering, infrastructure engineering or a similar role.
  • Strong knowledge on network and protocols, network security and cloud networking
  • Proven strong record of cloud cost optimisation ...
Posted
a month ago
Undisclosed

Singapore

  • Lead and drives device reliability/qualification activities, and data correlation studies.
  • Collect, analyse and manage large & disparate data sets and distill information into concise presentations with possible recommendations as quick feedback to the engineering team.
  • Develop new FMEA techniques to improve detection capability, productivity or process optimization. ...
Posted
21 days ago
Undisclosed

Singapore

  • Pioneer and implement the next generation telemetry system for AIS services
  • Establish alert handling procedures, run-books, and collaborate with our global security team
  • Automate deployment and orchestration of services into the cloud environment as well as other routine processes ...
Posted
21 days ago
Undisclosed

Singapore

  • Design and implement solutions that are secure and compliant by collaborating with dedicated security teams, conducting regular audits, and integrating advanced vulnerability scanning tools.
  • Identify and resolve performance bottlenecks and operational issues, define and track KPIs (e.g., MTTR, system uptime, cost efficiency), and drive ongoing optimisation efforts.
  • Act as a technical advisor for tenants, guiding them on containerization, and best practices for cloud-native deployments, and participating in strategic initiatives to enhance platform scalability and performance. ...
Posted
a month ago
Undisclosed

Singapore

  • Your Opportunity Starts Here.
  • Incident Response & RCA: Lead the response for complex virtualization, storage, or OS-level disruptions and conduct blameless post-mortems and Root Cause Analysis (RCA) to prevent systemic recurrence.
  • Systems Automation: Develop and maintain software tools (Python, PowerShell, Java) that automation if infrastructure task via, CI/CD pipelines, and to improve efficiency and reduce operational risk. ...
Posted
25 days ago
Undisclosed

Singapore

  • Establish best engineering practice for engineers as well as non-technical people.
  • Design and implement reliable, scalable, robust and extensible big data systems that support core products and business.
  • Bachelor's degree in Computer Science, a related technical field involving software or systems engineering, or equivalent practical experience. ...
Posted
25 days ago
MYR5,500 - MYR7,000 Per Month
  • Support Automation & Controls
  • Lead Equipment Setup & Improvements
  • Collaborate with Production Teams ...
Posted
25 days ago