Site Reliability Engineer - Docker - Global Energy Co.

London  ‐ Onsite
This project has been archived and is not accepting more applications.
Browse open projects on our job board.

Description

World Leading Energy Company has an exciting opportunity within Renewables and Energy Solutions for a Site Reliability Engineer to be a part of a highly motivated and talented team, delivering mission critical software. This is a key role to help deploy, manage, troubleshoot and enhance their complex cloud-based services for a wide variety of customers.

As a Site Reliability Engineer you will design and implement web applications and REST API services using a microservice-based infrastructure. The new technology stack includes (AWS), Docker/K8S, NoSQL/SQL database, and a range of monitoring tools.

You will build innovative automated solutions and tools to help debug and resolve problems in production and prevent them from recurring. You will also proactively seek out system weaknesses and find ways to fix them before they cause production issues using monitoring data, watching trends, and using Chaos Engineering.

Responsibilities
Keeping your assigned site or service up and running or getting it back up and running quickly when failure occurs
Closely work with internal partners and teams to ensure that they ship software that meets security, Compliance, SLA, and performance requirements
Write, update and use documentation, including runbooks/playbooks
Automate work including infrastructure needs, testing, failover solutions, failure mitigation..
Debug complex problems across an entire stack and creating solid solutions
Develop CI/CD processes to improve cadence
Use Chaos Engineering to test what you build under real-world conditions

About you:
7 years experience with software engineering, software development, or system operations
Know your way around a Unix/Linux Shell, can write Shell Scripts, and understand Linux internals
Experience debugging complex problems
Experience designing, building, and operating large-scale production systems
Knows Python, Java, Go, Rust, or similar
Understand networking and messaging, especially between services
Hands-on experience using source control (Git, GitHub, GitLab) and feature branching strategies
Experience with a variety of open-source databases (Postgres, Redis etc.)

Preferred:
Experience with containers, such as: Docker or Kubernetes
Experience with Elastic Stack Istio, HashiCorp vault, Prometheus, grafana
Proven track record of automating processes and auto Heal work
Experience with monitoring and observability such as with Datadog, Sensu, New Relic, and Nagios
Experience automating infrastructure, testing, and deployments using tools like Terraform, Helm and can explain the Infrastructure as Code paradigm
Experience with configuration management
Understands the idea behind Chaos Engineering, even if they haven't yet implemented it themselves
Worked in regulated industries such banking telecom power

This needs combining with a positive attitude and an ability to work within a large, globally dispersed project team in a multi-cultural environment. You also need to be a self-starter, a logical thinker and a quick learner, with strong initiative and excellent communication, interpersonal and presentation skills, able to write clearly and concisely. We believe in equality of opportunity for all job applicants regardless of gender, marital status, race, colour, nationality, ethnic origin, creed or religion, disability, sexual orientation or age. Specialising within Energy Trading, Oil & Gas, Financial Markets and TV & Entertainment, Eaglecliff Recruitment is ISO accredited, a Member of REC and listed within the top 4% for Financial stability by Dun & Bradstreet. Please telephone for an immediate response or email your CV for a reply within one hour. Eaglecliff Ltd is acting in the capacity of an employment agency for permanent recruitment and an employment business for contractor resourcing
Start date
2021-09-01
Duration
12 months Initially
From
Eaglecliff Recruitment
Published at
22.07.2021
Project ID:
2167478
Contract type
Freelance
To apply to this project you must log in.
Register