Member of Technical Staff (Cluster Manager)
Reka AI
Overview
Role involves managing and optimizing compute infrastructure for performance and reliability.
Ideal candidate has experience with large-scale systems and strong automation skills.
remotefull-timePythonBashDockerKubernetesPrometheusGrafanaAWSGCPAzure
Locations
Requirements
Experience managing large-scale distributed systems Experience with containerization technologies
Responsibilities
Ensure reliability and performance of compute infrastructure Design and maintain tools for system operations Collaborate with teams to meet infrastructure needs