P-1 AI

Software Engineer - AI Evals and Test

P-1 AI

Overview

Responsible for defining and validating evals for AI performance.

Ideal candidate has experience in software testing and strong Python skills.

You must have existing work authorization in the US or Canada

remotefull-timeEnglishPythongitCI/CD

Locations

  • Canada
  • United States

Requirements

  • Experience in constructing test suites
  • Experience designing evaluation metrics
  • Proficiency in Python programming

Responsibilities

  • Implement eval benchmarks
  • Ensure effective evals in CI/CD
  • Collaborate with partners and experts