// ROLE SUMMARY

Senior RLHF Evaluator

ClosedRLHF$65–75/hrRemote(EU)Posted February 13, 2026

// DESCRIPTION

We need evaluators who can read AI-generated text critically and make consistent quality judgments under detailed rubrics. On a given day you might compare two explanations of quantum mechanics, two pieces of marketing copy, and two responses to a sensitive personal question. The common thread is careful reading, rubric application, and clear written justifications. Speed matters, but not at the expense of thoughtfulness.

Ideal candidates have a graduate-level education or equivalent professional experience in a field that requires careful argumentation and evidence evaluation. We have had especially strong results from people with backgrounds in academic research, technical writing, legal analysis, and scientific peer review. Familiarity with LLM failure modes -- hallucination, sycophancy, refusal errors -- is valuable.

Onboarding takes about one week and includes rubric training, practice tasks with feedback, and a calibration exam. After onboarding, you work asynchronously on your own schedule. A Slack workspace provides real-time access to project leads and fellow annotators for guideline questions.

// SKILLS & REQUIREMENTS

Experience with RLHF or preference labeling pipelinesComfort evaluating content across diverse subject areasGood judgment on safety and sensitivity issuesGraduate-level education or equivalent professional experienceBackground in linguistics, philosophy, law, or STEM

This role isn't open right now. Apply generally and we'll match you when projects fit your background.

Apply generally

// FREQUENTLY ASKED QUESTIONS

// RELATED POSITIONS

More RLHF roles

RLHF

// READY TO GET STARTED?

Apply in minutes

Create your profile, select your areas of expertise, and start working on frontier AI projects.

Apply generally

Senior RLHF Evaluator

// DESCRIPTION

// SKILLS & REQUIREMENTS

// FREQUENTLY ASKED QUESTIONS

More RLHF roles

Human Feedback Annotator

RLHF Preference Annotator

LLM Alignment Evaluator

Conversational AI Rater

Apply in minutes