- Singapore
Working Location
Job Description
Responsibilities
Role Description
This is a technical role for an LLM Infrastructure Engineer responsible for designing, supporting, and optimizing infrastructure environments for large language model systems and AI applications. The LLM Infrastructure Engineer will be responsible for maintaining scalable computing environments, supporting deployment pipelines, monitoring infrastructure performance, and ensuring the reliability, efficiency, and stability of AI platforms and services. Additional responsibilities include collaborating with engineering and research teams, optimizing compute and storage resources, supporting cloud-based operations, troubleshooting infrastructure issues, and improving automation workflows to enhance operational efficiency. The role also involves monitoring system metrics, supporting model deployment processes, maintaining technical documentation, and contributing to continuous improvements for scalable AI infrastructure and platform reliability. The position requires strong technical problem-solving skills, attention to detail, and the ability to manage complex infrastructure operations in a fast-paced environment.
Qualifications
• Strong understanding of infrastructure engineering, cloud computing, and distributed system operations
• Knowledge of AI infrastructure environments, deployment pipelines, and scalable system architecture concepts
• Ability to support large-scale infrastructure operations, monitor system performance, and troubleshoot technical issues effectively
• Familiarity with cloud platforms, containerization technologies, orchestration tools, and workflow automation practices
• Strong analytical and problem-solving skills with attention to detail and operational efficiency
• Ability to collaborate with engineering, research, and operational teams in a technical environment
• Knowledge of infrastructure monitoring, system optimization, and operational reliability practices
• Familiarity with scripting, automation tools, and infrastructure management processes is an advantage
• Strong organizational and communication skills with the ability to manage multiple technical tasks efficiently
• Ability to work independently while contributing effectively within a collaborative engineering environment
• Professional attitude, adaptability, and commitment to maintaining reliable and scalable AI infrastructure operations
Important Information
Never provide your bank or credit card details when applying for jobs. Do not transfer any money or complete unrelated online surveys. If you see something suspicious, Report this Job ad.