Description
Ework is direct supplier to our client - there is no middle layer. This means that we have a good contact with procurement and in many cases, know the stakeholder and his needs.
For our client we are looking for a Site Reliability Engineer
Main responsibilities:
Be first point of entry for production support and service transition
Operate the platform in production including incident and problem management
Ensure that the MTTR and MTBF targets are met
Proactive monitoring and event management of the infrastructure
Manage infrastructure change and releases along with impact analysis
Work on Terraform automation and engineering together with the infrastructure team
Diagnose and troubleshoot issues rising from automation
Support the onboarding of new users and applications
Analyse and implement compliance requirements
User request fulfilment (eg setup Linux accounts, Consul configuration, cloud account setup)
Create operational documentation (KMS, SOPs, BCM)
Define SLAs, SLOs and SLIs and provide automated reporting of availability and error budget
Support on maturing the continuous deployment capabilities
Work closely on implementation roadmap with the development team
Support with continuous service improvements on automation, reliability and resiliency
Perform operational readiness assessments
Technology Skills:
Terraform
Attlasian Stack
Ansible, Puppet
Linux
Docker
Prometheus/Grafana/App Dynamics
Javascript, Phython, Golang
Consul
Hashicorp Vault
Experience with Datasynapse Grid is an asset
Start: January 2nd
Duration: December 20th 2019
Location: Copenhagen
Work load: 100%
Working language: English
We would like to thank you in advance for your application. However, due to a resource issue, we would like to inform you, that only the most relevant candidates will receive feedback.