Reka AI

Member of Technical Staff (Cluster Manager)

Reka AI

Overview

Role involves managing and optimizing compute infrastructure for performance and reliability.

Ideal candidate has experience with large-scale systems and strong automation skills.

remotefull-timePythonBashDockerKubernetesPrometheusGrafanaAWSGCPAzure

Locations

  • United States
  • United Kingdom

Requirements

  • Experience managing large-scale distributed systems
  • Strong scripting skills
  • Experience with containerization technologies

Responsibilities

  • Ensure reliability and performance of compute infrastructure
  • Design and maintain tools for system operations
  • Collaborate with teams to meet infrastructure needs