Cohere

Member of Technical Staff, MLE (Pre-Training Data)

Cohere

Overview

Machine Learning Engineer focusing on pretraining data and data pipeline development.

Ideal candidate has strong software engineering skills and experience with large-scale datasets.

remotepermanentfull-timeEnglishPythonApache Sparkpandas

Locations

  • United States, California, San Francisco
  • United States, California, London

Requirements

  • Strong software engineering skills
  • Experience with data processing frameworks
  • Knowledge of data quality assessment techniques

Responsibilities

  • Design and build scalable data pipelines
  • Conduct data ablations
  • Develop robust data modeling techniques
  • Research and implement innovative data curation methods
  • Collaborate with cross-functional teams

Benefits

  • Open and inclusive culture
  • Weekly lunch stipend
  • Full health and dental benefits
  • 100% Parental Leave top-up
  • Personal enrichment benefits
  • Remote-flexible work
  • 6 weeks of vacation