Liquid AI

Member of Technical Staff - Foundational Model Data

Liquid AI

Overview

Role focused on consolidating and generating high-quality text data for foundation model development.

Ideal candidate should have expertise in dataset engineering and strong programming skills in Python.

hybridmidpermanentfull-timeEnglishPythonML frameworks

Locations

  • United States, Massachusetts, Boston
  • United States, California, San Francisco

Requirements

  • B.S. + 5 years experience or M.S. + 3 years experience or Ph.D. + 1 year experience
  • Expertise in dataset engineering
  • Strong programming skills in Python

Responsibilities

  • Create and maintain data cleaning and selection pipeline
  • Gather datasets from the web
  • Write and maintain synthetic data generation pipelines
  • Run ablations to assess new datasets