Site Reliability Engineer
Writer
Overview
Role involves developing and implementing the SRE program to ensure system reliability and performance.
Ideal candidate has 7+ years of SRE experience and strong programming skills in Python, Java, or Go.
hybridseniorpermanentfull-timeEnglishTerraformPythonJavaGoAWSAzureGCPDockerKubernetesPrometheusGrafanaELK Stack+ 5 more
Locations
United Kingdom, England, London
Requirements
7+ years experience in SRE Bachelor's degree in Computer Science or related field Proficiency in Python, Java, Go Experience with AWS, Azure, or GCP Expertise in Docker and Kubernetes Knowledge of monitoring tools like Prometheus and Grafana Ability to mentor junior engineers Excellent communication skills
Responsibilities
Lead design and maintenance of cloud infrastructure Implement scalable cloud automation Automate infrastructure provisioning Collaborate with development teams Develop monitoring and alerting systems Conduct post-mortem analyses Optimize cloud infrastructure Ensure security and compliance
Benefits
Comprehensive medical and dental insurance Fertility and family planning support Early-detection cancer testing Competitive pension scheme Annual work-life stipends