jobs in Ampstek

全职 Machine Learning Engineer 工作, 薪水, Ampstek 公司招聘中 - Ricebowl

Machine Learning Engineer

Undisclosed

Singapore

分享
保存

工作地点

  • Singapore

职位描述

岗位职责

• Design and develop highly scalable, Real time systems using Hadoop ecosystem components(Iceberg, Spark, Ozone, Trino, Hive, Ranger, Kafka, Flink and Nifi)


• Build robust data ingestion and transformation frameworks using Java, Spark, Python, and shell scripting for ingesting multi model data(image, audio, video, unstructured documents) with both batch and real-time.


• Develop full stack applications and internal engineering tools using Python, shell scripting, and modern web frameworks (e.g., Flask, React).


• Collaborate closely with data scientists to operationalize machine learning models using Cloudera Machine Learning (CML).


• Perform performance tuning and optimization of data applications on Hadoop to ensure optimal resource utilization.


• Experience working with ML platforms such as CML, Spark MLlib, and Python ML libraries (scikit learn, XGBoost), including model deployment.


• Design and develop highly scalable, Real time systems using Hadoop ecosystem components(Iceberg, Spark, Ozone, Trino, Hive, Ranger, Kafka, Flink and Nifi)


• Build robust data ingestion and transformation frameworks using Java, Spark, Python, and shell scripting for ingesting multi model data(image, audio, video, unstructured documents) with both batch and real-time.


• Develop full stack applications and internal engineering tools using Python, shell scripting, and modern web frameworks (e.g., Flask, React).


• Collaborate closely with data scientists to operationalize machine learning models using Cloudera Machine Learning (CML).


• Perform performance tuning and optimization of data applications on Hadoop to ensure optimal resource utilization.


Total Experience


10+ yrs


Relevant Experience


6+ yrs


Mandatory skills


• Hadoop ecosystem (Spark, Hive, Kafka, Flink, NiFi, Iceberg, Trino)


• Java, Python, Spark (batch & real-time processing)


• Data ingestion & transformation frameworks


• Performance tuning on Hadoop platforms


• Shell scripting


• Real-time data processing systems


• ML model operationalization (CML / Spark ML)


Desired skills


• Full-stack development (Flask, React)


• Multi-model data processing (image, audio, video, unstructured)


• Cloudera Machine Learning (CML)


• ML libraries (scikit-learn, XGBoost)


• Data governance tools (Ranger)


• Ozone storage experience

重要安全守则

申请工作时,切勿提供您的银行或信用卡详细资料。不要转账或完成无关的在线调查问卷。如果您发现可疑内容,请举报此招聘广告。

了解更多