Deepgram

Platform Engineer -- AI/ML Infrastructure

Deepgram

Overview

Expert Platform Engineer to build and operate AI/ML infrastructure.

Ideal candidate has 5+ years in Platform Engineering with strong Kubernetes and Terraform skills.

160k usd / yearremoteseniorpermanentfull-timeKubernetesTerraformAWSCI/CDPythonGobash

Locations

  • United States

Requirements

  • 5+ years experience in Platform Engineering
  • Hands-on experience with Terraform
  • Expert knowledge of Kubernetes
  • Experience with HPC job schedulers like Slurm
  • Experience managing bare metal infrastructure
  • Strong scripting skills in Python, Go, Bash

Responsibilities

  • Architect and maintain core computing platform
  • Develop and manage infrastructure with IaC
  • Design and optimize AI/ML job scheduling
  • Provision and maintain bare metal infrastructure
  • Implement networking and storage solutions
  • Develop observability stack
  • Collaborate with AI researchers
  • Automate deployment life cycle