Overview
Role focused on ensuring reliability, scalability, and performance of critical systems.
Ideal candidate should have 5+ years of experience in Site Reliability Engineering with strong Kubernetes skills.
hybridseniorpermanentfull-timeEnglishKubernetesPrometheusELKPuppetAnsibleLinuxDNS
Locations
Israel, Tel Aviv, Tel Aviv
Requirements
5+ years experience in SRE or similar Deep understanding of SRE principles Extensive experience with Kubernetes Strong experience with monitoring tools Proficiency in configuration management tools Solid understanding of Linux and networking Strong programming skills in Python and/or Go Experience with managing core services
Responsibilities
Ensure reliability and performance of infrastructure Manage Kubernetes infrastructure Design and maintain monitoring stack Automate provisioning and deployment Troubleshoot complex infrastructure issues Participate in on-call rotations Develop infrastructure-as-code Collaborate with development teams
Benefits