Biology (PhD)

11 March 2026 Hourly Remote English
Mercor
Apply on → Mercor
$73.29 – $73.29 per hour

Fluent Language Skills Required: English

Why This Role Exists

Mercor partners with leading AI teams to improve the quality, usefulness, and reliability of general-purpose conversational AI systems. These systems are used across a wide range of everyday and professional scenarios, and their effectiveness depends on how clearly, accurately, and helpfully they respond to real user questions.

Life sciences AI must accurately reflect complex biological systems, experimental reasoning, and evolving scientific understanding. This project focuses on improving how models reason about and explain biological concepts across molecular, organismal, and systems-level topics.

What You’ll Do

  • Write and refine prompts to guide model behavior in the life sciences context
  • Evaluate LLM-generated responses to biology-related queries for scientific accuracy and reasoning quality
  • Conduct fact-checking using authoritative public sources and domain knowledge
  • Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies
  • Assess clarity, structure, and appropriateness of explanations for different audiences
  • Ensure model responses align with expected conversational behavior and system guidelines
  • Apply consistent evaluation standards by following clear taxonomies, benchmarks, and detailed evaluation guidelines

Who You Are

  • You hold a PhD in Biology or a closely related life sciences field 
  • You have deep expertise in one or more of the following sub-domains:
  • Molecular & Cellular Biology
  • Organismal Physiology & Development
  • Microbiology, Immunology & Pathobiology
  • Ecology, Evolution & Environmental Biology
  • You have significant experience using large language models (LLMs) and understand how and why people use them
  • You have excellent writing skills and can clearly explain complex life sciences concepts
  • You have strong attention to detail and consistently notice subtle issues others may overlook
  • Experience reviewing or editing technical or academic writing

Nice-to-Have Specialties

  • Prior experience with RLHF, model evaluation, or data annotation work
  • Experience teaching, mentoring, or explaining life sciences concepts to non-expert audiences
  • Familiarity with evaluation rubrics, benchmarks, or structured review frameworks

What Success Looks Like

  • You identify inaccuracies or weak mechanistic explanations for life science-related queries
  • Your feedback improves the rigor, clarity, and correctness of AI explanations
  • You deliver consistent, reproducible evaluation artifacts that strengthen model performance
  • Mercor customers trust their AI systems in life sciences and biology contexts because you’ve rigorously evaluated them

Why Join Mercor

This role allows life sciences PhDs to apply their expertise to the development of high-quality AI systems, influencing how biology is explained and understood at scale.

Compensation

  • Pay: $73.29/hour
  • Type: Hourly contract
  • Location: Remote

46 people hired recently for this role. 55 slots remaining.

Getting Started

New to Remote Gig Work?

No fluff, no theory. The First Month Playbook walks you through profile setup, landing your first client, and building a workflow that actually sticks.

Read the Playbook
New to Remote Gig Work?
Featured Platform

Start on Outlier AI

Outlier (by Scale AI) hires writers, coders, and subject experts for AI training tasks. Flexible hours, remote-first. Affiliate link — we may earn a commission.

Join Outlier
Start on Outlier AI