A leading financial technology firm is seeking a Site Reliability Engineer (SRE) to support low-latency trading platforms within a follow-the-sun model. Based in Singapore, this role ensures high availability, performance, and reliability of mission-critical systems during US Global Trading Hours, working closely with global engineering and operations teams.
Key Responsibilities
- Manage platform configuration, deployments, and change activities across production and DR environments
- Lead incident response, troubleshooting, and root cause analysis for real-time trading systems
- Monitor and optimize system performance, capacity, and availability
- Support low-latency infrastructure, including Linux tuning and networking stacks
- Drive automation and process improvements using scripting (e.g., Python)
- Perform data analysis and reporting using SQL and system logs
- Collaborate with software, network, and infrastructure teams globally
- Participate in on-call rotations and weekend testing activities
Requirements
- Bachelor’s degree in computer science or related field
- 3+ years in technical operations / SRE / production support
- Strong expertise in Unix/Linux (5+ years)
- Experience in systems, network, or database administration
- Proficiency in SQL and programming (Python, C++, etc.)
- Knowledge of networking (TCP/IP, multicast) is advantageous
- Strong communication skills with ability to operate independently