Site Reliability Engineering Lead (SRE)

London  ‐ Onsite
This project has been archived and is not accepting more applications.
Browse open projects on our job board.

Description

Site Reliability Engineering Lead (SRE)

Our client, a leading global supplier for IT services, requires a Site Reliability Engineering Lead (SRE) to join their client's office in London. You can work remotely until covid abates.

This is a 12-month temporary contract, to start ASAP.

Key Responsibilities

  • CI/CD Automation (Build & Release)
  • Micro Services
  • Application Services
  • Automated Unit/Integration/Load Testing
  • Performance Testing & Monitoring
  • Logging, Monitoring & Alerting
  • Container Image Management & Security
  • Networks
  • Service Mesh
  • API Gateway Management
  • IAM - Identity & Access Management
  • Azure Policy Management
  • Cloud Security
  • Cost Management
  • To support our new platform, we understand that a wide range of skills are necessary, and those skills can only be satisfied through a self-organised and highly cohesive team

Key Requirements

  • 8 years of relevant work experience in critical production environments
  • Very strong DevOps skill set with expertise in integration tools like Jenkins, Azure DevOps, Nexus, and Git
  • Experience programming in at least two of the following languages: Java, Groovy, Scala, Python, Go, C++, JavaScript, PowerShell, or Shell
  • Provisioning components and services using ARM & Helm Templates
  • Kubernetes & Istio
  • Experience in some of these Azure technologies App Gateway, API Manager, AKS, Cosmos DB, Azure SQL, Azure Firewall
  • Building and Maintaining micro service architecture that with REST API's
  • Proficient in monitoring tools eg Azure Monitor/Log Analytics/Dynatrace/Splunk
  • Systematic problem-solving approach, strong communication skills and a sense of ownership
  • Working in an Agile environment
  • Experience architecting, developing, and troubleshooting large scale systems
  • Linux operating system level experience (eg filesystems, system calls)
  • Experience with algorithms and data structures
  • Terraform
  • DB provisioning and/or tuning
  • Apache Kafka
  • SQL, GraphQL, or Type Script
  • Automated Testing frameworks
  • Good command over written & verbal communication
  • Stakeholder Management and Leadership Skills

Start date
ASAP
Duration
12 months
From
Project Recruit
Published at
26.07.2021
Project ID:
2170445
Contract type
Freelance
To apply to this project you must log in.
Register