- Kuala Lumpur, Kuala Lumpur Kuala Lumpur WP Kuala Lumpur Malaysia
Working Location
Job Description
Responsibilities
Location: Malaysia
Experience: 1–4 years
Employment Type: Full-time
We are looking for an AI / LLM Engineer with hands-on experience in building production-ready RAG and multimodal AI systems.
The ideal candidate should have experience working with LLMs, document AI, retrieval pipelines, vector databases, and multimodal workflows involving text, images, PDFs, or OCR-based systems.
This role requires practical engineering experience beyond tutorials or academic projects.
RequirementsMinimum 1 year of full-time experience in AI/LLM engineering
Hands-on experience with:
Retrieval-Augmented Generation (RAG)
Multimodal AI / Vision-Language Models (VLMs)
LLM application development
Experience building or deploying:
document AI systems
chatbot/agentic workflows
OCR + LLM pipelines
semantic search systems
AI APIs/services
Strong Python skills
Experience with at least some of the following:
LangChain / LangGraph
Vector databases (ChromaDB, FAISS, Qdrant, Pinecone, etc.)
Open-source LLMs
FastAPI / Flask
Docker
Hugging Face
Prompt engineering
Experience working with:
PDFs
images
OCR
embeddings
reranking
retrieval pipelines
Mandarin communication skills preferred
Agentic AI workflows
GraphRAG / Knowledge Graphs
Evaluation pipelines / hallucination detection
GPU inference optimization
Cloud deployment (AWS/GCP/Azure)
CI/CD or MLOps exposure
Design and develop RAG and multimodal AI applications
Build scalable AI pipelines for document and knowledge retrieval
Integrate LLMs with internal systems and APIs
Optimize retrieval quality, latency, and response accuracy
Work on production deployment, monitoring, and evaluation of AI systems
Collaborate with cross-functional teams to deliver enterprise AI solutions
Important Information
Never provide your bank or credit card details when applying for jobs. Do not transfer any money or complete unrelated online surveys. If you see something suspicious, Report this Job ad.