Description
As SRE you combine software and system engineering to ensure the services run smoothly.
You may be part of few projects partnering with product development for the whole life cycle with different levels of engagement.
You will spend most of the time building operational tooling, automating operational workflows, performing architecture and design reviews, investigating system failures and complex outages, improving the monitoring infrastructure, and defining service level objectives.
Your main responsibilities:
- You know how to build robust, fault-tolerant and highly scalable Web Services that support our global growth
- Write, review and ship software to improve the availability, scalability, latency, and efficiency of our services.
- Engage in service capacity planning and demand forecasting, software performance analysis and system tuning.
The most important experience for this position:
- Good programming skills in one or more of: Python, Go, Java, Scala, Javascript and open to pick up new ones.
- Strong hands-on experience with infrastructure automation tools and orchestrators (the whole Hashicorp stack, SaltStack, etc.)
- Strong expertise with monitoring frameworks (Grafana, Kibana, Prometheus)
- Working knowledge of the TCP/IP stack, Internet routing and load balancing.
- Preferred expertise with Cassandra Kafka and Elasticsearch
Feel free to apply for this position with your resume!
Michael Bailey International is acting as an Employment Business in relation to this vacancy.