Job Summary
We are seeking a highly skilled AI Algorithm Engineer specializing in Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) to design, develop, and optimize intelligent systems. The ideal candidate will work on cutting-edge AI solutions, improving model performance, and building scalable applications for real-world use cases.
Key Responsibilities
- Design, develop, and optimize LLM-based applications and RAG pipelines
- Build and maintain end-to-end AI systems, including data ingestion, embedding, retrieval, and generation
- Fine-tune and evaluate large language models for domain-specific tasks
- Implement prompt engineering, chain-of-thought strategies, and agent workflows
- Develop vector search solutions using tools such as FAISS, Pinecone, or Weaviate
- Optimize system performance (latency, accuracy, cost efficiency)
- Collaborate with product managers, data engineers, and backend teams to deliver scalable solutions
- Conduct experiments, A/B testing, and model evaluation using relevant benchmarks
- Ensure AI systems comply with data privacy, security, and ethical standards
Requirements
- Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, or related field
- 3–6 years of experience in machine learning, NLP, or AI engineering
- Strong experience with LLM frameworks (e.g., OpenAI API, Hugging Face, LangChain, LlamaIndex)
- Hands-on experience with RAG architecture and vector databases
- Proficiency in Python and ML libraries (PyTorch, TensorFlow, Scikit-learn)
- Experience with cloud platforms (AWS, GCP, or Azure)
- Solid understanding of NLP, embeddings, and information retrieval techniques
- Strong problem-solving skills and ability to work in a fast-paced environment