Period: 418 (start on 17 Mar 2025).
Deadline: 04 Feb 2025.
Duties/Roles:
1. Under the direction of Service Delivery Manager of Service Oriented Architecture and Identity and Access Management (SOA & IdM), the engineer will perform duties such as the following:
2. Design and deploy highly available, secure, and scalable Kubernetes clusters on-premises or in the cloud.
3. Collaborate with development teams to containerize applications and deploy them on Kubernetes, optimizing for resource utilization and performance.
4. Champion and implement Infrastructure as Code (IaC) practices using tools like Terraform, Ansible, or Pulumi to manage and automate Kubernetes deployments and configurations.
5. Establish robust monitoring and logging solutions using tools like Prometheus, Grafana, and ELK stack to ensure the health and performance of Kubernetes clusters and applications.
6. Continuously analyze and optimize Kubernetes cluster performance, identifying and resolving bottlenecks and resource constraints.
7. Implement security best practices and controls to safeguard Kubernetes clusters from threats and vulnerabilities.
8. Diagnose and resolve complex issues related to Kubernetes and containerized applications, providing timely solutions to incidents and outages.
9. Work closely with development, operations, and security teams to foster a collaborative environment and provide guidance on Kubernetes best practices.
Skills, Knowledge, Experience Required:
Mandatory:
1. The candidate must have a currently active NATO SECRET security clearance.
2. At least 5 years' experience in Kubernetes Orchestrator service management including:
3. Hands-on experience with Kubernetes in production environments, including designing, deploying, and managing large-scale clusters.
4. Deep understanding of containerization technologies like Docker, Podman and container orchestration principles.
5. Proven experience with Infrastructure as Code tools like Terraform, Ansible, or Pulumi.
6. Familiarity with major providers Rancher, Openshift, Docker.
7. Familiarity with major cloud providers (AWS, Azure, GCP) and their Kubernetes offerings (EKS, AKS, GKE).
8. Familiarity with major virtualization providers (VMware, Microsoft).
9. Experience with monitoring and logging tools like Prometheus, Grafana, and the ELK stack.
10. Experience with service meshes like Istio or Linkerd.
11. Experience with HELM and CI/CD (ArgoCD/ GitOps).
12. Experience with Storage (Software Defined / Backend), S3 solutions, MinIO, Nooba etc.
13. Proficient in scripting languages like Bash or Python for automation tasks.
14. Excellent analytical and problem-solving skills with the ability to diagnose and resolve complex technical issues.
15. Strong communication and interpersonal skills with the ability to work effectively in a team environment.
16. Ability to manage multiple tasks and projects in a fast-moving environment, and to work in cross-functional teams.
#J-18808-Ljbffr