75% Remote: Infrastructure Delivery Lead (f/m/d)

Land Berlin, Berlin  ‐ Hybrid
This project has been archived and is not accepting more applications.
Browse open projects on our job board.

Keywords

Automation Quality Management Ansible Workflows Scripting Infrastructure as Code (IaC) Gitlab Jenkins Test Automation Bash Shell Databases Continuous Integration Data Centers Elasticsearch Infrastructure As A Service (IaaS) Python (Programming Language) Product Management Resource Management Datadog Data/Record Logging Test-Driven Development (TDD) Fluentd Grafana Kubernetes Kibana Terraform Splunk Ansible Tower

Description

For our client we are looking for a Infrastructure Delivery Lead (f/m/d).

Frame data:
Start: November 2024
Duration: 31.03.25 ++ (long term)
Capacity: 100% if possible
Location: Berlin / Remote (1 week Berlin / 3 weeks remote in rotation), up to 50% onsite in peak times

Role:
- Infrastructure product delivery is divided into two teams: Infrastructure Core with datacenter, network, compute & storage delivery and – in the system stack on top of those - infrastructure IaaS (ICE = infrastructure core engine) delivery.
- The Capabilities Lead is a part of the ICE Delivery leadership team.
- The Capabilities Lead is tasked with ensuring the delivery of products specified by product management and co-designed by the architecture team. This entails creating high-level and low-level designs, completing engineering tasks, developing a working product, and ensuring quality assurance.
- Scrum rituals and backlog grooming are part of the workload management in this role.
- Besides engineering activities, the Capabilities Lead is looking after the engineers in his team, their performance and development.
- This role is responsible for setting the vision of where the team should be in a
couple of years, as well as executing tactical initiatives, to enable and grow the team. Operational activities like timesheet and resource management round the responsibilities of the role.

Targets:
- Engineering Excellence: An engineering background with an emphasis on designing and implementing long term secure solutions rather than quick fixes.
- CI/CD and Automation: Proficiency in integrating continuous testing practices within the CI/CD pipeline using tools like GitLab, Jenkins, Tekton, Argo Workflows, and ArgoCD.
- Scripting and Automation: Solid foundation in scripting and automation with a security-conscious approach, ensuring that security & reliability practices are integrated into automated processes.
- Robust Infrastructure: Proficiency in overseeing the reliability of infrastructure components like servers, networks, and cloud resources, with a strong focus on Infrastructure as Code (IaC) to ensure continuous deployments.
- Security Measures: Proficiency in securing the system using SSL/TLS encryption for data protection, experience with secret stores like Hashi Corp Vault, and an understanding of the zero-trust and the least privilege security concept.

Skills (must-have):
- Kubernetes Expertise:
Thorough knowledge of how Kubernetes operates internally, which involves setting it up, adjusting its size, fixing issues, and managing various Kubernetes versions. This also includes the use of Helm charts. Understanding how its networking and storage functions work on a low level. Having experience with cloud provided (like AKS, GKE) as well as self-managed/on-prem k8s clusters
- Infrastructure Management:
Experience in administering infrastructure components and proficiency in Infrastructure as Code (IaC) is necessary. A strong grasp of how data is stored, shared, and secured (storage) and how information moves within environments perspective is necessary. Good understanding of ingress and egress flows both internally and externally including interconnect across the provisioned estate. Tooling like Terraform, Terragrunt, Shell, Bash, Python are well known.
- CI/CD:
Proven ability to set up and manage CI/CD pipelines using tools like GitLab, Jenkins, Tekton, Argo Workflows, and ArgoCD. Design and implement fully automated Build and Deployment pipelines.
- Observability Systems:
Proficiency in setting up monitoring and logging systems for real-time insights into system performance. Familiarity with tools like Prometheus, Grafana, and optionally other similar stacks (e.g., EFK - Elasticsearch, Fluentd, Kibana) and monitoring technologies like Splunk, Datadog, etc.
- Quality Assurance:
Demonstrated commitment to ensuring high-quality deliverables through rigorous standards and processes, by employing TDD and/or BDD methodologies.
- Fluent English in speech and writing (at least C1)

Skills (should-have):
- Experience in following areas is optional and welcome: Infrastructure Test Automation
- Experience in following areas is optional and welcome: Graph Databases (like Neo4j, ArangoDB, …)
- Experience in following areas is optional and welcome: Ansible, Ansible Automation Platform, Ansible Tower
Start date
11.2024
Duration
12 months
(extension possible)
From
Nemensis AG
Published at
12.09.2024
Contact person:
Jens Eberling
Project ID:
2795428
Industry
IT
Contract type
Freelance
Workplace
80 % remote
To apply to this project you must log in.
Register