Principal Site Reliability Engineer
Groupon
Overview
Role focused on ensuring performance, availability, and resilience of mission-critical systems.
Ideal candidate should have 10+ years in systems engineering with expertise in cloud platforms and container orchestration.
remoteseniorEnglishTerraformAnsibleKubernetesDockerPrometheusGrafanaELK StackPythonGoBash+ 1 more
Locations
Requirements
10+ years in systems engineering 5+ years in SRE or DevOps Expertise in cloud platforms Proficiency in programming languages Advanced knowledge of IaC tools
Responsibilities
Architect and maintain fault-tolerant systems Drive automation in infrastructure management Create and optimize CI/CD pipelines Build observability solutions Design performance testing strategies Guide architectural decisions
Benefits
Cutting-edge technologies Collaborative work culture Professional growth opportunities