Wikimedia Foundation

Staff Site Reliability Engineer

Wikimedia Foundation

Overview

Role focused on designing and maintaining ML infrastructure for Wikimedia.

Ideal candidate has 7+ years in SRE with strong ML infrastructure experience.

Only candidates from specified countries are considered

129k usd / yearremoteseniorEnglishKubernetesDockerTerraformAnsibleHelmPrometheusGrafanaELK Stack

Locations

  • United States, California

Requirements

  • 7+ years in SRE or DevOps
  • Expertise in ML infrastructure
  • Proficiency in automation tools

Responsibilities

  • Design and implement ML infrastructure
  • Improve reliability and scalability
  • Collaborate with teams
  • Monitor and optimize performance
  • Provide guidance and documentation
  • Mentor team members