Platform Engineer, MLOps
Writer
Overview
Role focused on deploying and managing infrastructure for AI/ML operations.
Ideal candidate has 5+ years of experience building core infrastructure and operating orchestration systems at scale.
hybridfull-timeEnglishDockerKubernetesTerraformPythonBashGITGitHubPrometheusGrafana
Locations
United Kingdom, England, London
Requirements
Professional experience with model training Experience with Huggingface Transformers Experience with infrastructure as code tools like Terraform Experience with cloud platforms like Google Cloud, AWS or Azure
Responsibilities
Design and deploy CI/CD pipeline Manage monitoring, logging, and alerting systems Ensure training environments are available Develop containerization and orchestration systems Operate large Kubernetes clusters Improve software solutions reliability Measure and optimize system performance Provide operational support for software applications
Benefits
Comprehensive medical and dental insurance Fertility and family planning support Early-detection cancer testing Competitive pension scheme Annual work-life stipends