← All roles Data & AI

Human Baseliner for Open-Ended ML Research Tasks

$75 to $90/hr

## Overview

We are hiring experienced machine learning engineers and researchers to serve as **human baseliners** for evaluations of open-ended machine learning research tasks. These evaluations measure how well AI agents perform on realistic AI R&D problems. To interpret agent performance, we also need strong human reference points: skilled practitioners attempting the same tasks under the same time and compute constraints. As a baseliner, you will complete self-contained ML research tasks in a sandboxed environment, working independently with your preferred tools and workflow. Your performance will be used as a benchmark against which frontier-model agents are evaluated.

## What You’ll Do

## Commitment

## Requirements

Candidates must meet **all** of the following:

## Required Domain Expertise

Candidates must have strong practical experience in **at least one** of the following:

## Logistics (work trial requirements)

Apply for this role

How it works: apply here and we connect you to our hiring partner for this role. By continuing you agree we may forward your application.