Job Description
We are seeking an experienced Data Engineer to design, build, and maintain scalable data solutions that support business intelligence, analytics, and operational reporting. The ideal candidate will have strong expertise in data engineering, ETL/ELT development, cloud data platforms, and big data technologies.
Responsibilities
- Design, develop, and maintain data ingestion, transformation, and integration pipelines from multiple source systems.
- Build and optimize ETL/ELT workflows to ensure reliable and efficient data processing.
- Integrate structured and unstructured data from APIs, databases, files, and external platforms into centralized data environments.
- Develop and maintain scalable data models to support reporting, analytics, and business requirements.
- Perform data validation, reconciliation, and troubleshooting to ensure data accuracy and consistency across systems.
- Implement data quality frameworks, lineage tracking, metadata management, and governance controls.
- Optimize data pipelines and query performance for large-scale datasets.
- Collaborate with business stakeholders, analysts, and technical teams to gather requirements and deliver data solutions.
- Support User Acceptance Testing (UAT), production deployments, post-go-live monitoring, and continuous enhancement initiatives.
- Investigate and resolve data-related incidents, ensuring minimal disruption to business operations.
- Maintain technical documentation, data dictionaries, and operational procedures.
- Implement monitoring, security, and access control mechanisms across data platforms.
Requirements
- Minimum 4 years of experience in Data Engineering or a related field.
- Strong proficiency in Python and SQL.
- Solid understanding of data modeling, data warehousing concepts, and database design.
- Hands-on experience with big data processing technologies such as Apache Spark and/or Databricks.
- Proven experience designing and building ETL/ELT pipelines.
- Experience with cloud-based data platforms and storage solutions such as AWS S3, data lakes, or similar technologies.
- Experience integrating data from multiple sources, including REST APIs, databases, flat files, and third-party systems.
- Familiarity with data governance, lineage tracking, monitoring, and access control practices.
- Strong analytical, troubleshooting, and problem-solving skills.
- Excellent communication and stakeholder management abilities.
Nice to Have
- Experience with cloud platforms such as AWS, Azure, or Google Cloud Platform (GCP).
- Knowledge of orchestration tools such as Airflow or similar workflow management solutions.
- Experience with data observability and monitoring tools.
- Exposure to CI/CD practices and DevOps methodologies.
- Experience working in Agile environments.
Preferred Skills
Python | SQL | Spark | Databricks | ETL/ELT | Data Modeling | AWS S3 | Data Lakes | APIs | Data Warehousing | Data Governance | Data Quality | Data Lineage | Cloud Data Platforms