- Singapore
工作地点
职位描述
岗位职责
Role Description
This is a technical role for an LLM Infrastructure Engineer responsible for designing, supporting, and optimizing infrastructure environments for large language model systems and AI applications. The LLM Infrastructure Engineer will be responsible for maintaining scalable computing environments, supporting deployment pipelines, monitoring infrastructure performance, and ensuring the reliability, efficiency, and stability of AI platforms and services. Additional responsibilities include collaborating with engineering and research teams, optimizing compute and storage resources, supporting cloud-based operations, troubleshooting infrastructure issues, and improving automation workflows to enhance operational efficiency. The role also involves monitoring system metrics, supporting model deployment processes, maintaining technical documentation, and contributing to continuous improvements for scalable AI infrastructure and platform reliability. The position requires strong technical problem-solving skills, attention to detail, and the ability to manage complex infrastructure operations in a fast-paced environment.
Qualifications
• Strong understanding of infrastructure engineering, cloud computing, and distributed system operations
• Knowledge of AI infrastructure environments, deployment pipelines, and scalable system architecture concepts
• Ability to support large-scale infrastructure operations, monitor system performance, and troubleshoot technical issues effectively
• Familiarity with cloud platforms, containerization technologies, orchestration tools, and workflow automation practices
• Strong analytical and problem-solving skills with attention to detail and operational efficiency
• Ability to collaborate with engineering, research, and operational teams in a technical environment
• Knowledge of infrastructure monitoring, system optimization, and operational reliability practices
• Familiarity with scripting, automation tools, and infrastructure management processes is an advantage
• Strong organizational and communication skills with the ability to manage multiple technical tasks efficiently
• Ability to work independently while contributing effectively within a collaborative engineering environment
• Professional attitude, adaptability, and commitment to maintaining reliable and scalable AI infrastructure operations
重要安全守则
申请工作时,切勿提供您的银行或信用卡详细资料。不要转账或完成无关的在线调查问卷。如果您发现可疑内容,请举报此招聘广告。