Articul8

Senior Site Reliability Engineer (SRE)

Articul8

Overview

Experienced SRE responsible for ensuring reliability and performance of GenAI SaaS platform.

Ideal candidate has 5+ years of experience in SRE or DevOps with strong cloud platform skills.

remoteseniorfull-timeAWSGCPAzurePythonGoBashTerraformCloudFormationDockerKubernetesPrometheusGrafanaELK Stack

Locations

  • Brazil

Requirements

  • Bachelor's degree or equivalent experience
  • 5+ years in DevOps or SRE
  • Strong experience with cloud platforms
  • Proficiency in programming/scripting language
  • Hands-on experience with infrastructure as code tools
  • Solid background in containerization technologies
  • Proven experience with monitoring and observability tools

Responsibilities

  • Architect and maintain scalable infrastructure
  • Design and implement monitoring solutions
  • Automate deployment and management
  • Define and improve SLOs and SLIs
  • Participate in on-call rotations
  • Collaborate with development teams
  • Lead incident response efforts
  • Optimize infrastructure for performance