Software Engineer - AI Evals and Test

P-1 AI

Overview

Responsible for defining and validating evals for AI performance.

Ideal candidate has experience in software testing and strong Python skills.

You must have existing work authorization in the US or Canada

remotefull-timeEnglishPythongitCI/CD

Locations

Canada
United States

Requirements

Experience in constructing test suites
Experience designing evaluation metrics
Proficiency in Python programming

Responsibilities

Implement eval benchmarks
Ensure effective evals in CI/CD
Collaborate with partners and experts