Our client is a global market infrastructure provider that supports the financial system through highly reliable trading and investment platforms. Operating across multiple regions and time zones, they enable continuous market access and focus on stability, performance, and innovation. Their teams collaborate closely to maintain high‑availability systems and drive ongoing improvements to critical financial technology.
Key Responsibilities
- Build, configure, and maintain Linux servers and storage platforms in global trading environments
- Manage OS lifecycle, configuration management (Salt), RPM packaging, and system updates
- Act as primary responder for Linux P1/P2 incidents, driving triage, RCA, and remediation
- Tune system performance and capacity for low‑latency, high‑availability environments
- Operate and support bare‑metal infrastructure, including hardware health and NIC/kernel tuning
- Design and implement automation using Python and Shell to reduce operational overhead
- Develop monitoring, dashboards, and operational reporting using Prometheus/Grafana
- Own Linux platform delivery for infrastructure projects end‑to‑end
- Participate in on‑call rotations, weekend maintenance, and failover testing
Requirements
- At least 5 years supporting large‑scale (hundreds of servers), 24x7 Linux environments
- Strong Linux kernel expertise (scheduling, networking, I/O, performance tuning)
- Experience with configuration management (Salt/Puppet), RPM builds, and patching
- Automation skills using Python and Shell scripting
- Experience with bare‑metal systems; cloud (AWS/GCP) and Kubernetes exposure preferred
- Monitoring and observability experience (Prometheus, Grafana or similar)
- Working knowledge of IaC/DevOps tools (Terraform, Git, CI/CD)
- Storage and capacity planning experience (SAN/NAS, NVMe, RAID)
- Methodical problem solver with strong attention to detail
Work Schedule & On‑Call
- Shift-based schedule to support follow‑the‑sun operations (Shifts start as early as 5AM)
- Participation in 24x7 on‑call rotations
- Periodic weekend maintenance and testing required