jobs in Expleo

Expleo Hiring! Full Time AWS Monitoring - Observability Engineer in - Ricebowl

AWS Monitoring - Observability Engineer

Expleo

Undisclosed

Singapore

Share
Save

Working Location

  • Singapore Singapore

Job Description

Responsibilities

Overview:
We are seeking a AWS Observability Engineer to implement, and standardize monitoring, alerting, logging, and dashboarding solutions across AWS environments. The role will be responsible for building reusable observability frameworks leveraging AWS-native services and Infrastructure-as-Code (IaC) practices.
The ideal candidate should have strong expertise in AWS CloudWatch, AWS Managed Grafana, Terraform, monitoring best practices, and operational visibility across cloud infrastructure and applications.
Qualifications:
Monitoring & Observability
  • Implement end-to-end monitoring solutions across AWS environments.
  • Configure and manage Amazon CloudWatch Metrics, Logs, Dashboards, and Alarms.
  • Implement monitoring standards, naming conventions, and reusable monitoring templates.
  • Configure log collection, retention policies, and centralized visibility.
  • Establish alerting standards and operational monitoring practices.
Grafana Dashboard Development
  • Develop reusable AWS Managed Grafana dashboards.
  • Build dashboards for:
    • EC2
    • ECS/Fargate
    • EKS
    • RDS
    • Lambda
    • API Gateway
    • Security Monitoring
  • Create executive, operational, infrastructure, and application dashboards.
  • Implement dashboard templating and multi-environment support.
Infrastructure as Code
  • Develop reusable Terraform modules for:
    • CloudWatch Alarms
    • SNS Topics
    • Grafana Dashboards
    • Monitoring Configuration
  • Ensure monitoring deployments are repeatable and scalable.
  • Maintain version-controlled infrastructure code repositories.
Security Monitoring
  • Integrate AWS Security Hub and GuardDuty into monitoring solutions.
  • Build security visibility dashboards.
  • Configure security alerting and notification workflows.
  • Support operational visibility of cloud security findings.
Alerting & Incident Visibility
  • Configure alert thresholds and alarm strategies.
  • Implement SNS-based notifications using email and SMS.
  • Define alert severity classifications.
  • Reduce alert noise through effective threshold management.
Documentation & Knowledge Transfer
  • Prepare implementation documentation.
  • Create operational runbooks and support procedures.
  • Conduct knowledge transfer sessions for operations teams.
  • Maintain implementation and configuration documentation.

Important Information

Never provide your bank or credit card details when applying for jobs. Do not transfer any money or complete unrelated online surveys. If you see something suspicious, Report this Job ad.

Learn More