Member of Technical Staff - Foundational Model Data
Liquid AI
Overview
Role focused on consolidating and generating high-quality text data for foundation model development.
Ideal candidate should have expertise in dataset engineering and strong programming skills in Python.
hybridmidpermanentfull-timeEnglishPythonML frameworks
Locations
United States, Massachusetts, Boston United States, California, San Francisco
Requirements
B.S. + 5 years experience or M.S. + 3 years experience or Ph.D. + 1 year experience Expertise in dataset engineering Strong programming skills in Python
Responsibilities
Create and maintain data cleaning and selection pipeline Gather datasets from the web Write and maintain synthetic data generation pipelines Run ablations to assess new datasets