Overview
Role responsible for ensuring system reliability, availability, and scalability through automation and performance optimization.
Ideal candidate should have 4+ years of experience in Site Reliability Engineering or related fields with strong cloud platform knowledge.
hybridmidpermanentfull-timeEnglishAWSAzureGCPTerraformDockerKubernetesBashPythonGoPrometheusGrafanaDatadog+ 5 more
Locations
Requirements
4+ years experience in SRE, DevOps, or System Engineering Strong knowledge of cloud platforms Experience with observability and monitoring tools
Responsibilities
Ensure reliability, availability, and scalability of systems Design and implement scalable systems Develop and maintain observability tools Automate infrastructure provisioning Optimize system performance Conduct root cause analysis
Benefits
Top-tier Health Insurance Public Transportation Pass Air Conference 2025 in Las Vegas