- Kuala Lumpur Federal Territory Malaysia
工作地点
职位描述
岗位职责
Company Description
B & H Software House BV is an international technology solutions provider specializing in end-to-end custom software development, digital transformation, and technical program execution. Originally incorporated in the Netherlands, the company now operates as a cross-border agile delivery house with strong centers across Western Europe and Southeast Asia, including Malaysia. B & H Software House BV partners with a diverse global client base, from high-growth startups to mid-market enterprises, to deliver scalable and production-ready software solutions. The organization focuses on bridging complex business strategies with practical, high-performing technology implementations. Team members join a dynamic, multicultural environment with opportunities to work on impactful, global projects.
We are looking for a seasoned DevOps Engineer to own and optimize our enterprise Azure Cloud platform supporting microservices architecture. You will drive operational governance, cost efficiency (FinOps), platform resilience, advanced observability, and reliable API/traffic management while ensuring high availability, security, and performance for mission-critical applications. This is a high-impact, hands-on role focused on Infrastructure as Code, GitOps, incident management, and continuous improvement in a collaborative environment.
Key Responsibilities
• Azure Operational Governance & FinOps:
o Drive cost optimization and governance initiatives include defining cost baselines, setting up alerts, right-sizing resources, and optimizing Azure Front Door and Azure Load Balancer configurations. Ensure cost/performance trade-offs are transparent and well-communicated to stakeholders.
• Platform Resilience & Disaster Recovery:
o Own backup, restore validation, and disaster recovery strategies for Azure services and databases (**Azure SQL, PostgreSQL, MongoDB, Cosmos DB**). Define and meet RPO/RTO targets and conduct regular DR testing and drills.
• Observability & Monitoring:
o Implement and continuously enhance observability using Datadog (metrics, logs, traces, dashboards, SLOs) integrated with Azure telemetry. Enable proactive issue detection, faster troubleshooting, and reduction in MTTR.
• Incident & Problem Management:
o Lead and actively participate in incident response (triage, mitigation, communication, post-incident reviews) and problem management to prevent recurrence. Maintain and improve operational runbooks and automate repetitive tasks.
• API Gateway & Traffic Management:
o Operate and troubleshoot the traffic and API layer, including Azure Front Door, Azure Load Balancer, and Kong Gateway (routing, certificates, plugins, policies) to ensure secure, reliable, and high-performance connectivity for microservices.
• Infrastructure as Code & GitOps:
o Develop and maintain infrastructure using Terraform. Implement and manage GitOps workflows with FluxCD for declarative, reliable deployments.
• Database Operations:
o Manage provisioning, scaling, performance tuning, backup, and high availability for Azure SQL, PostgreSQL, MongoDB, and Cosmos DB.
• Security & Compliance:
o Integrate security practices using Wiz, Splunk, and Black Duck for vulnerability management, logging, and secure configurations.
• Documentation & Knowledge Sharing:
o Create and maintain comprehensive platform documentation (architecture diagrams, IaC modules, runbooks, monitoring guides) and contribute to engineering standards across teams.
• Collaboration & Ownership:
o Demonstrate strong ownership and a customer-first mindset. Collaborate effectively with product teams, developers, SREs, and security stakeholders. Communicate clearly during incidents and changes while driving continuous improvement.
Requirements
Email us@ *************
重要安全守则
申请工作时,切勿提供您的银行或信用卡详细资料。不要转账或完成无关的在线调查问卷。如果您发现可疑内容,请举报此招聘广告。