Skip to content
Showcase/LLM Response Preference
intermediatepreference

LLM Response Preference

Compare AI-generated responses to collect preference data for RLHF training.

LLM Response Preference annotation interface preview

Configuration Fileconfig.yaml

yaml
annotation_task_name: "LLM Response Preference Collection"
task_dir: "."

port: 8000

# Data configuration
data_files:
  - "data/response_pairs.json"

item_properties:
  id_key: id
  text_key: text

# Annotation schemes
annotation_schemes:
  # Main preference rating
  - annotation_type: pairwise
    name: overall_preference
    description: "Overall, which response is better?"
    options:
      - label: "A is much better"
        value: "A++"
      - label: "A is slightly better"
        value: "A+"
      - label: "About equal"
        value: "="
      - label: "B is slightly better"
        value: "B+"
      - label: "B is much better"
        value: "B++"
    sequential_key_binding: true

  # Individual aspect ratings
  - annotation_type: pairwise
    name: helpfulness
    description: "Which response is more helpful?"
    options:
      - label: "A"
        value: "A"
      - label: "Equal"
        value: "="
      - label: "B"
        value: "B"

  - annotation_type: pairwise
    name: accuracy
    description: "Which response is more accurate?"
    options:
      - label: "A"
        value: "A"
      - label: "Equal"
        value: "="
      - label: "B"
        value: "B"

  - annotation_type: pairwise
    name: safety
    description: "Which response is safer/less harmful?"
    options:
      - label: "A"
        value: "A"
      - label: "Equal"
        value: "="
      - label: "B"
        value: "B"

  # Reasons for preference
  - annotation_type: multiselect
    name: preference_reasons
    description: "What factors influenced your choice? (Select all that apply)"
    labels:
      - More accurate information
      - Better explained
      - More concise
      - More thorough
      - Better formatting
      - More appropriate tone
      - Safer/less harmful

  # Free-text justification
  - annotation_type: text
    name: justification
    description: "Briefly explain your preference"
    textarea: true
    required: false
    placeholder: "Why did you prefer one response over the other?"

# User settings
require_password: false

# Output
output_annotation_dir: "annotation_output/"
output_annotation_format: "json"

Get This Design

This design is available in our showcase. Copy the configuration below to get started.

Quick start:

# Create your project folder
mkdir pairwise-preference
cd pairwise-preference
# Copy config.yaml from above
potato start config.yaml

Details

Annotation Types

pairwisemultiselecttext

Domain

NLPAI/ML

Use Cases

RLHFpreference learningmodel evaluation

Tags

llmpreferencerlhfcomparison