Wikimedia Foundation

Staff Site Reliability Engineer

Wikimedia Foundation

Overview

Role focused on designing and maintaining ML infrastructure for Wikimedia.

Ideal candidate has 7+ years in SRE with expertise in ML systems and infrastructure.

Only candidates from specified countries are considered

129k usd / yearremoteseniorEnglishKubernetesDockerTerraformAnsibleHelmPrometheusGrafanaELK Stack

Locations

  • Singapore
  • United States
  • Egypt
  • Costa Rica
  • Greece
  • Netherlands
  • Sweden
  • Ireland
  • Brazil
  • Poland
  • France
  • Nigeria
  • Croatia
  • Colombia
  • Uruguay
  • United Kingdom
  • Ghana
  • Kenya
  • Switzerland
  • India
  • Spain
  • Canada
  • Czech Republic
  • Finland
  • Denmark
  • Mexico
  • Italy
  • South Africa
  • Uganda
  • Israel
  • Australia
  • Peru
  • Germany
  • Estonia
  • Indonesia

Requirements

  • 7+ years in SRE or DevOps
  • Expertise in ML infrastructure
  • Strong English communication skills

Responsibilities

  • Design and implement ML infrastructure
  • Improve reliability and scalability
  • Collaborate with teams
  • Monitor and optimize system performance
  • Provide guidance and documentation
  • Mentor team members