Principal Site Reliability Engineer

Groupon

Overview

Role focused on ensuring performance, availability, and resilience of mission-critical systems.

Ideal candidate should have 10+ years in systems engineering with expertise in cloud platforms and container orchestration.

remoteseniorEnglishTerraformAnsibleKubernetesDockerPrometheusGrafanaELK StackPythonGoBash

Locations

  • Peru

Requirements

  • 10+ years in systems engineering
  • 5+ years in SRE or DevOps
  • Expertise in cloud platforms
  • Proficiency in programming languages
  • Advanced knowledge of IaC tools

Responsibilities

  • Architect and maintain fault-tolerant systems
  • Drive automation in infrastructure management
  • Create and optimize CI/CD pipelines
  • Build observability solutions
  • Lead incident response
  • Design performance testing strategies
  • Mentor junior engineers
  • Guide architectural decisions

Benefits

  • Cutting-edge technologies
  • Collaborative work culture
  • Professional growth opportunities
  • Impactful work