The Infrastructure Lead is a senior player-manager responsible for owning the infrastructure strategy, architecture, and roadmap. This role leads the infrastructure team with strategic oversight and handson technical capability, ensuring platform reliability, security, cost-efficiency, and business continuity, underpinned by modern DevSecOps practices.
Key Responsibilities
Infrastructure Strategy, Governance, Provisioning & Operations
- Own and drive the infrastructure strategy, platform decisions, and technology roadmap in alignment with organisational objectives.
- Govern infrastructure costs, capacity planning, and IaC change controls across all environments.
- Ensure infrastructure controls are documented and evidenced to support the ISO 27001:2022 certification programme.
- Build and configure cloud infrastructure using IaC tooling; manage networking, secrets, and certificate lifecycle across all environments.
- Maintain production infrastructure stability, manage Kubernetes clusters, and execute patch management within defined SLAs.
Team Leadership
- Lead the infrastructure team as a player-manager, remaining hands-on and technically capable.
- Provide mentorship to the Senior Cloud Engineer and infrastructure staff; cover on-call duties as required.
- Manage third-party infrastructure service providers, including SLA governance and performance management.
Business Continuity, Disaster Recovery and Support
- Establish and maintain the DR plan across all infrastructure systems, ensuring it is documented, tested, and evidenced.
- Collaborate with the BCM on BCP and produce DR test records as ISO 27001:2022 audit evidence.
- On-call availability is required to support 24x7 incident response coverage.
Requirements
Experience
- 10+ years of hands-on infrastructure or platform engineering experience, with 3+ years in a leadership role.
- Proven experience managing production infrastructure including 24x7 on-call incident response and DR execution.
- Demonstrated experience designing and delivering DR and BCP programmes.
- Experience managing third-party infrastructure or managed service providers; ISO 27001:2022 frameworks exposure.
- Background in regulated industries (financial services preferred); on-premise or hybrid infrastructure experience advantageous.
Technical Skills
- Cloud infrastructure: architecture, provisioning, governance, and security hardening (AWS, Azure, or GCP)
- Infrastructure as Code (IaC): Terraform, or equivalent
- Networking: LAN/WAN, VPCs, security groups, firewall rules, DNS, and TLS certificate management
- Container orchestration: Kubernetes cluster management and CI/CD pipeline infrastructure
- DevSecOps practices: GitOps workflows, infrastructure-as-code governance, and platform engineering standards
- Observability: monitoring, alerting, and on-call incident management for 24x7 environments
- Secrets management, patch management, and configuration management at scale
Qualifications
- Bachelor's degree or higher in Computer Science, IT, Engineering, or a related discipline.
- Cloud platform certifications (AWS, Azure, or GCP) strongly preferred; ITIL or TOGAF advantageous.