Software Engineer - AI Evals and Test
P-1 AI
Overview
Responsible for defining and validating evals for AI performance.
Ideal candidate has experience in software testing and strong Python skills.
You must have existing work authorization in the US or Canada
remotefull-timeEnglishPythongitCI/CD
Locations
Requirements
Experience in constructing test suites Experience designing evaluation metrics Proficiency in Python programming
Responsibilities
Implement eval benchmarks Ensure effective evals in CI/CD Collaborate with partners and experts