ClickUp

Staff Site Reliability Engineer

ClickUp

Overview

Role focused on improving stability and reliability of cloud-based infrastructure.

Ideal candidate should have 4-6+ years of AWS experience and a strong operational focus.

remotemidEnglishAWSKubernetesDockerTerraformGitHub ActionsArgoCDDatadogCloudWatchPostgreSQL

Locations

  • Poland

Requirements

  • 4-6+ years AWS experience
  • Kubernetes experience
  • Production-critical infrastructure management experience
  • Familiarity with SRE best practices
  • Experience with IaC and CI/CD
  • Knowledge of network and security best practices
  • Experience with monitoring tools
  • Linux-based EC2 management experience

Responsibilities

  • Design and build systems for performance and reliability
  • Collaborate with engineering teams
  • Increase stability and observability
  • Champion monitoring infrastructure
  • Implement site reliability improvements
  • Respond to downtime events
  • Participate in brainstorming sessions