Hey there! We are looking for a Cloud Operations Engineer with experience in Kubernetes and Azure- or other- cloud platform to join our Cloud & Operations Team! Is this you?
As a Cloud Operations Engineer you will be building, running and supporting the cloud infrastructure that supports our Platform. Our product is a software platform that allows developers to build Bots and Conversational AI solutions in a unified coding and analytics IDE. The company is at the forefront of Generative AI and Large Language Models (LLM) technology, including GPT, which allows us to deliver superior customer service solutions. You are going to be part of a team of system and support engineers - delivering world-class technical and application support to our dynamic and growing customer base using English as the daily working language.
This is a full-time position, hybrid or remote and ideally located in Spain, with a requirement to be part of a 24x7 on-call rota, providing support to critical failures within our customer production environments.
Wondering what you will be doing on a “normal day”?
* Day to day operation of our cloud ☁️ (currently Azure, Kubernetes based platform),
* Day to day operation of our product based in cloud ☁️ (Traditional VM infrastructure in AWS controlled via CloudFormation),
* Provide install support and ongoing operational support for our on-premise customers (Typically VM hosted Linux systems)
* Proactively monitor and maintain environments ensuring optimal uptime,
* Deployment of infrastructure components in Kubernetes using k8s operators, helm charts etc,
* Troubleshoot any incidents which may arise,
* Develop our platform cloud - participate in projects within the wider team around areas of your interest and expertise (monitoring, scalability, security, testing, processes, compliance (ISO 27001, SOC2), automation, optimization, technical documentation, new technology labs and research).
1. What are we looking for in you?
* 5 to 10 years of experience from similar positions, using English as working language,
* Solid experience working in a Production environment with a clear understanding of risks, risk analysis and operational responsibility,
* Solid experience with Azure or other cloud like AWS, Google..,
* Solid experience with Kubernetes (a plus: experience running Java applications in Kubernetes),
* Experience with VM management, package and configuration deployment via orchestration systems like Salt or Puppet,
* Knowledge and experience with backup scheduling, vaulting, and retention. Preferably for both system and application data backups,
* Experience with implementing Nagios, or other similar monitoring system,
* Experience of at least one scripting or programming language (such as Go, Python),
* Experience with writing technical and process documentation,
* Strong analytical, multi-tasking and problem-solving skills,
* Ability to independently work on projects and tasks,
Meriting requirements
* Experience working with Java /JVM based apps,
* Experience in setup and management of cloud architectures through Infrastructure As Code (a plus: experience with Pulumi, Terraform),
* Experience working in CI/CD pipelines with GitLab or Jenkins,
* Experience with writing test cases and test methodology.