Key Responsibilities
- Design conceptual, logical, and physical data models for operational and analytics use cases
- Define and manage schemas for both OLTP (operational) and OLAP (analytics/serving) systems
- Establish and maintain database standards including naming conventions, keys, constraints, and referential integrity
- Design and optimize database performance (indexes, partitioning, query tuning, materialized views)
- Develop and maintain scalable ETL/ELT pipelines for structured and unstructured data
- Implement batch and near real-time data ingestion workflows
- Build data flows across Raw → Staging → Serving layers, including retention and reprocessing logic
- Ensure pipelines are fault-tolerant, idempotent, observable, and production-ready
- Implement data validation and quality checks (reconciliation, duplication detection, completeness rules)
- Maintain end-to-end data lineage (source → transformation → output layers)
- Implement operational logging and metrics (job status, failure tracking, throughput, latency, processing time)
- Support data compliance requirements including auditability, traceability, and access logging
- Support backup, replication, and disaster recovery alignment
- Work closely with architects, backend engineers, DevOps, security, and product teams
Requirements
- 5+ years of experience in Data Engineering or backend data systems
- Strong SQL expertise with PostgreSQL (schema design & performance tuning)
- Experience in end-to-end data pipeline ownership in production environments
- Strong data modelling skills (normalized OLTP and analytical/serving models)
- Proficiency in Python (or similar) for ETL/pipeline automation
- Experience with object storage systems (e.g., MinIO / S3)
- Solid understanding of indexing, partitioning, and materialized views
- Experience with monitoring, logging, and production reliability practices
- Strong understanding of data lifecycle and data quality principles
Preferred Qualifications
- Bachelor’s Degree in Computer Science, Data Engineering, Software Engineering, or related field
- Master’s Degree is an added advantage
- Familiarity with OpenSearch / Elasticsearch
- Exposure to AI/ML data workflows or AI-driven systems
- Experience in secure, enterprise, or regulated environments
Job Types: Full-time, Contract
Contract length: 12 months
Pay: RM7,000.00 - RM9,000.00 per month
Benefits:
- Flexible schedule
- Opportunities for promotion
- Professional development
- Work from home
Application Question(s):
- Do you have experience working on AI data projects or data feeding pipelines for AI/ML systems?
- Have you used GitHub Copilot or similar AI-assisted development tools before?
Work Location: In person