Platform Engineer – Job Description
The Software Engineer is responsible for designing, building, and maintaining scalable, secure, and automated platforms that enable high‑velocity software delivery across the enterprise. This role blends cloud engineering, DevOps, automation, infrastructure‑as‑code, and platform reliability to provide a seamless developer experience. The Platform Engineer works closely with development, security, and operations teams to create self‑service, resilient, and observable platforms.
Key Responsibilities
- Design, build, and maintain cloud‑native platforms (AWS/Azure/GCP) that support microservices, APIs, and distributed systems.
- Develop and maintain Infrastructure as Code (IaC) using Terraform, ARM, CloudFormation, or Pulumi.
- Build CI/CD pipelines and automation frameworks to accelerate application delivery.
- Implement and manage container platforms such as Kubernetes, OpenShift, or AKS/EKS/GKE.
- Ensure platform scalability, reliability, security, and performance through best practices.
- Develop self‑service capabilities for developers (templates, pipelines, reusable modules).
- Implement monitoring, logging, and observability using tools like Prometheus, Grafana, ELK, Splunk, or Datadog.
- Collaborate with security teams to enforce DevSecOps, compliance, and governance.
- Troubleshoot platform issues, perform root‑cause analysis, and drive continuous improvement.
- Participate in architectural discussions and contribute to platform strategy and roadmaps.
- Mentor junior engineers and promote engineering excellence across teams.
Required Skills & Experience
- Bachelor’s degree in Computer Science, Engineering, or related field.
- 8-14 years of experience in DevOps, cloud engineering, or platform engineering.
- Strong expertise in:Kubernetes, Docker, container orchestrationTerraform, IaC, GitOpsCI/CD tools (Azure DevOps, Jenkins, GitHub Actions, GitLab CI)Cloud platforms (AWS, Azure, or GCP)
- Hands‑on experience with:Linux administration, networking, load balancingService mesh (Istio, Linkerd) and API gatewaysMonitoring & observability (Prometheus, Grafana, ELK, OpenTelemetry)Scripting (Python, Bash, PowerShell)
- Strong understanding of:DevOps practices, SRE principles, automationSecurity frameworks, secrets management, identity & accessHigh availability, scalability, and distributed systems