Period: 418 (start on 17 Mar. 2025).
Deadline: 04 Feb. 2025.
Duties/Roles:
Under the direction of the Service Delivery Manager of Service Oriented Architecture and Identity and Access Management (SOA & IdM), the engineer will perform duties such as the following:
1. Design and deploy highly available, secure, and scalable Kubernetes clusters on-premises or in the cloud;
2. Collaborate with development teams to containerize applications and deploy them on Kubernetes, optimizing for resource utilization and performance;
3. Champion and implement Infrastructure as Code (IaC) practices using tools like Terraform, Ansible, or Pulumi to manage and automate Kubernetes deployments and configurations;
4. Establish robust monitoring and logging solutions using tools like Prometheus, Grafana, and ELK stack to ensure the health and performance of Kubernetes clusters and applications;
5. Continuously analyze and optimize Kubernetes cluster performance, identifying and resolving bottlenecks and resource constraints;
6. Implement security best practices and controls to safeguard Kubernetes clusters from threats and vulnerabilities;
7. Diagnose and resolve complex issues related to Kubernetes and containerized applications, providing timely solutions to incidents and outages;
8. Work closely with development, operations, and security teams to foster a collaborative environment and provide guidance on Kubernetes best practices.
Skills, Knowledge, Experience Required:
Mandatory:
1. The candidate must have a currently active NATO SECRET security clearance;
2. At least 5 years' experience in Kubernetes Orchestrator service management including:
3. Hands-on experience with Kubernetes in production environments, including designing, deploying, and managing large-scale clusters;
4. Deep understanding of containerization technologies like Docker, Podman, and container orchestration principles;
5. Proven experience with Infrastructure as Code tools like Terraform, Ansible, or Pulumi;
6. Familiarity with major providers Rancher, Openshift, Docker;
7. Familiarity with major cloud providers (AWS, Azure, GCP) and their Kubernetes offerings (EKS, AKS, GKE);
8. Familiarity with major virtualization providers (VMware, Microsoft);
9. Experience with monitoring and logging tools like Prometheus, Grafana, and the ELK stack;
10. Experience with service meshes like Istio or Linkerd;
11. Experience with HELM and CI/CD (ArgoCD/ GitOps);
12. Experience with Storage (Software Defined / Backend), S3 solutions, MinIO, Nooba, etc.;
13. Proficient in scripting languages like Bash or Python for automation tasks;
14. Excellent analytical and problem-solving skills with the ability to diagnose and resolve complex technical issues;
15. Strong communication and interpersonal skills with the ability to work effectively in a team environment;
16. Ability to manage multiple tasks and projects in a fast-moving environment and to work in cross-functional teams.
#J-18808-Ljbffr