Working closely with trading teams, risk management, business management and compliance to understand their needs. Acting as the main coordinator to make sure all requirements are met and appropriate solution is setup in production accordingly.
Solving problems by providing level one and level two support for our production systems.
Contributing to the design and implementation of the support system to enhance reliability and self-correction.
...
An understanding of infrastructure elements and deployment architecture is required.
A strong programming background and ability to deliver successful outcomes consistently on infrastructure automation solutions are essential in this role.
The candidate should have a good understanding of interfacing applications with vRA / vRO APIs.
...
Planning and coordination of shutdown maintenance activities of Rotating Equipment based on local and Global PM requirements, Vendor recommendation & Industrial Best practices.
Review risk analysis and maintenance procedures on regular basis.
Upkeep maintenance records for equipment modifications, repair and rerating on regular basis.
...
Managing the widely-deployed Order Management Systems and Market Data Delivery Systems involving every major electronic exchange and asset class.
Working closely with trading teams, risk management, business management and compliance to understand their needs. Acting as the main coordinator to make sure all requirements are met and appropriate solution is setup in production accordingly.
Solving problems by providing level one and level two support for our production systems.
...
A strong programming background and ability to deliver successful outcomes consistently on infrastructure automation solutions are essential in this role.
The candidate should have a good understanding of interfacing applications with vRA / vRO APIs.
A strong programming background and ability to deliver successful outcomes consistently on infrastructure automation solutions are essential in this role.
...
For our teams, we create an environment with opportunities for our people to succeed, backed by the culture and support to ensure they are enabled to truly own their careers. We are motivated individuals who tackle unique technical challenges at scale and solve them as a team. Together, we deliver innovative and ethical solutions that help businesses achieve their ambitions faster.
Site Reliability Engineer
We provide our merchants a single platform, capable of meeting the rapidly evolving needs of today's fast-growing global businesses. To meet the high expectations of our merchants, Adyen has adopted and embedded principles from the Site Reliability Engineering discipline, offering an environment whereby data-driven decisions, intellectual curiosity, problem solving and openness are key drivers for success.
...
A Day in the Life of a Lead / Senior Site Reliability Engineer:
For this role, you will play a key role in maintaining our cloud platform, which includes an assortment of Kubernetes, Microservices, MongoDB, RabbitMQ, MySQL, Windows Server VM Infrastructure, Orchestration Engines, CI/CD and Monitoring platforms. Your day will consist of:
Executing projects that rollout new platform maintenance features, automate tasks, or other big picture changes
...
As a Site Reliability Engineer (SRE) based in Singapore, you will play a critical role supporting our Blockdaemon team by ensuring the reliability, scalability, and performance of our systems and services. You will collaborate closely with cross-functional teams to design, implement, and maintain robust and resilient infrastructure solutions. The ideal candidate is passionate about automation, possesses strong analytical skills, and thrives in a fast-paced, dynamic environment.
Define metrics to evaluate system performance and runtime, improving observability. Plan system capacities to accommodate business growth and promotions.
Analyze production incidents to establish best practices for a highly available payment architecture.
At least 3 years relevant work experience from a large-scale systems.
...
Review architecture and software components with software engineers and architects, ensuring consistent best practices across all teams.
Own and ensure Service Level Objectives (SLOs) and Service Level Agreements (SLAs) are met, monitoring operational metrics and leading improvement plans.
Manage and audit security controls to meet enterprise requirements, collaborating with legal and compliance for risk management.
...
Planning and coordination of shutdown maintenance activities of Rotating Equipment based on local and Global PM requirements, Vendor recommendation & Industrial Best practices.
.
Review risk analysis and maintenance procedures on regular basis.
.
Upkeep maintenance records for equipment modifications, repair and rerating on regular basis.
.
...
Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.
Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.
...
Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.
Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.
To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.
...
Production Engineering is responsible for the world’s most reliable, observable, performant, and safe network ecosystem. Our customers rely on our products and systems to safely modify, troubleshoot, and release products without external impact.
Our external customers rely on us to provide seamless and predictable incident, traffic, policy management, resulting in the fastest and safest network services in the world.
We are accountable for the overall performance of internal and external facing services, guiding our product teams to optimal configurations and maximum efficiency. From the moment that a packet enters the Cloudflare ecosystem, we know exactly what its expected purpose and behavior is and we are capable of determining and exposing anomalous behavior.
...
Define metrics to evaluate system performance and runtime, improving observability. Plan system capacities to accommodate business growth and promotions.
Analyze production incidents to establish best practices for a highly available payment architecture.
At least 3 years relevant work experience from a large-scale systems.
...