LLM Response Preference
Compare AI-generated responses to collect preference data for RLHF training.

Configuration Fileconfig.yaml
annotation_task_name: "LLM Response Preference Collection"
task_dir: "."
port: 8000
# Data configuration
data_files:
- "data/response_pairs.json"
item_properties:
id_key: id
text_key: text
# Annotation schemes
annotation_schemes:
# Main preference rating
- annotation_type: pairwise
name: overall_preference
description: "Overall, which response is better?"
options:
- label: "A is much better"
value: "A++"
- label: "A is slightly better"
value: "A+"
- label: "About equal"
value: "="
- label: "B is slightly better"
value: "B+"
- label: "B is much better"
value: "B++"
sequential_key_binding: true
# Individual aspect ratings
- annotation_type: pairwise
name: helpfulness
description: "Which response is more helpful?"
options:
- label: "A"
value: "A"
- label: "Equal"
value: "="
- label: "B"
value: "B"
- annotation_type: pairwise
name: accuracy
description: "Which response is more accurate?"
options:
- label: "A"
value: "A"
- label: "Equal"
value: "="
- label: "B"
value: "B"
- annotation_type: pairwise
name: safety
description: "Which response is safer/less harmful?"
options:
- label: "A"
value: "A"
- label: "Equal"
value: "="
- label: "B"
value: "B"
# Reasons for preference
- annotation_type: multiselect
name: preference_reasons
description: "What factors influenced your choice? (Select all that apply)"
labels:
- More accurate information
- Better explained
- More concise
- More thorough
- Better formatting
- More appropriate tone
- Safer/less harmful
# Free-text justification
- annotation_type: text
name: justification
description: "Briefly explain your preference"
textarea: true
required: false
placeholder: "Why did you prefer one response over the other?"
# User settings
require_password: false
# Output
output_annotation_dir: "annotation_output/"
output_annotation_format: "json"
Get This Design
This design is available in our showcase. Copy the configuration below to get started.
Quick start:
# Create your project folder mkdir pairwise-preference cd pairwise-preference # Copy config.yaml from above potato start config.yaml
Details
Annotation Types
Domain
Use Cases
Tags
Related Designs
DPO Preference Data Collection
Pairwise preference annotation for Direct Preference Optimization, based on Rafailov et al., NeurIPS 2023. Annotators compare two model responses to a prompt, select a preference, rate alignment dimensions, and provide reasoning.
Pairwise Preference with Rationale
Compare two AI responses and select the better one while providing a written justification. Used for reward model training with interpretable preference signals.
SWE-Bench+ Patch Screening
Screen and compare model-generated patches against gold patches for SWE-Bench+ instances. Annotators evaluate correctness, identify specific issues, and compare model vs. gold solutions side-by-side.