jobs in Exasoft

Exasoft Hiring! Full Time Infrastructure SRE Lead (Core Java 1.8,Spring Boot) in - Ricebowl

Infrastructure SRE Lead (Core Java 1.8,Spring Boot)

Undisclosed

Singapore

Share
Save

Working Location

  • Singapore

Job Description

Responsibilities

  • 8+years of engineering experience with strong infrastructure SRE background.
  • Excellent hands on experience in Core Java 1.8, Spring, Spring Boot, Quartz
  • Hands on experience with messaging systems such as RabbitMQ and IBM MQ
  • Strong working experience on Linux Operating Systems (Oracle Linux 7.6)
  • Extensive hands‑on expertise across cloud platforms, Kubernetes, IaC, automation, and observability.
  • Experience designing and optimizing enterprise infrastructure, high‑availability systems, and messaging platforms.
  • Prior experience guiding developers/SREs, conducting technical reviews, and driving engineering best practices.
  • Experience with Application Servers, preferably IBM WebSphere / Apache Tomcat 8.5.x
  • Excellent and proven experience in Oracle SQL and PL/SQL
  • Strong knowledge and experience in XML, XSL, XSLT Experience with Hardware Security Modules (HSM) such as PayShield 9000 / Shield XC High
  • Excellent knowledge of communication protocols such as REST, AMQP, JMS
  • Experience working with DevOps teams and industry standard CI/CD tools including:Jenkins, GitLab, Kubernetes, Docker, Terraform, Ansible, CloudFormation, Chef CI, GitHub Actions, Scripting using Python, Go, and Bash Cloud exposure on AWS, Azure, or GCP
  • Proven experience in supporting large, complex, high availability, high volume applications
  • Understanding of failover mechanisms and disaster recovery
  • Excellent communication and interpersonal skills


Key Responsibilities


  • Lead and mentor a team of SRE engineers and platform developers across distributed locations.
  • Translate architectural requirements into detailed technical solutions and engineering tasks.
  • Review and guide design, code, automation pipelines, Kubernetes deployments, and IaC modules.
  • Architect and optimize enterprise Linux environments, virtualization platforms, clustering, and NFS storage.
  • Lead deployment and tuning of messaging systems including IBM MQ, RabbitMQ, and Erlang/Mnesia engines.
  • Oversee implementation of load balancers (F5 LTM/ASM/ASR), proxies, networking, and hybrid connectivity.
  • Design, build, and optimize cloud and hybrid environments across AWS/Azure/GCP.
  • Lead Kubernetes architecture, resource organization, workload scaling, observability, and resilience patterns.
  • Guide container platform enhancements including service mesh, pod security, and cluster governance.
  • Architect IaC frameworks using Terraform, Ansible, CloudFormation, and Chef.
  • Develop reusable automation standards, modules, libraries, and operational workflows.
  • Drive automation for provisioning, patching, compliance, deployments, and operational tasks.
  • Build platform utilities, automation services, schedulers, orchestration engines using Core Java, Spring Boot, Quartz, and Erlang.
  • Develop integration modules for messaging systems, connectors, event pipelines, and internal micro‑services.
  • Implement operational tooling, log processors, alert engines, webhook handlers, and reliability frameworks.
  • Design and maintain observability pipelines using Prometheus, Grafana, Datadog, and Splunk.
  • Ensure reliability, performance, and cluster tuning across relational and NoSQL platforms.
  • Communicate technical decisions, risks, and progress clearly to stakeholders.
  • Support delivery governance and contribute to engineering roadmaps and improvement initiatives.



Technologies / Tools


  • Operating Systems & Virtualization
  • Enterprise Linux, VMware, OVM, X86 clusters
  • Containerization & Orchestration
  • Kubernetes, Docker
  • Application Development (Platform)
  • Core Java1.8, Spring, Spring Boot, Quartz, Erlang
  • Messaging
  • IBM MQ, RabbitMQ, Erlang/Mnesia
  • Infrastructure Automation
  • Terraform, Ansible, CloudFormation, Chef
  • Scripting
  • Python, Go, Bash
  • CI/CD
  • Jenkins, GitLab CI, GitHub Actions
  • Monitoring & Logging
  • Prometheus, Grafana, Datadog, Splun
  • Storage & Databases
  • Oracle, HA DB clusters, NFS, HPE Nimble, DataDomain
  • Load Balancing & Networking
  • F5 LTM/ASM/ASR, enterprise network concepts
  • File Transfer & Directory Services
  • GoAnywhere, Tivoli Directory Server
  • Cloud Platforms
  • AWS, Azure, GCP
  • Security
  • HSM (Payshield or equivalent)

Important Information

Never provide your bank or credit card details when applying for jobs. Do not transfer any money or complete unrelated online surveys. If you see something suspicious, Report this Job ad.

Learn More