jobs in Hyphen Connect Limited

Hyphen Connect Limited Hiring! Full Time Synthetic Data Engineer (AI Data-Training) in Hong Kong - Ricebowl

Synthetic Data Engineer (AI Data-Training)

Hyphen Connect Limited

Undisclosed

Hong Kong

Share
Save

Working Location

  • Hong Kong Hong Kong

Job Description

Responsibilities

We are seeking a talented and innovative Synthetic Data Engineer. In this role, you will design and implement domain-specific synthetic data generation pipelines, ensuring high-quality data management for training loops. Your expertise will drive the success of data processing and model training within the organization.


Responsibilities:

  • Design domain-specific synthetic data generation (SDG) pipelines via self-instruct and constitutional prompting.
  • Implement automated quality scoring and de-duplication systems.
  • Manage data pipelines that feed directly into SFT and DPO training loops.

Qualifications:

  • Proven experience building large-scale data pipelines (Airflow, Spark, Ray).
  • Deep knowledge of prompt engineering for data generation.
  • Familiarity with dataset distillation and bias mitigation.

Important Information

Never provide your bank or credit card details when applying for jobs. Do not transfer any money or complete unrelated online surveys. If you see something suspicious, Report this Job ad.

Learn More