FAVA: Fine-grained Hallucination Annotations for Faithful Generation

Fine-grained hallucination span annotation. Annotators identify hallucinated spans in LLM output and classify hallucination types (entity error, relation error, contradicted, invented, subjective, unverifiable). Based on the FAVA framework for fine-grained faithfulness evaluation.

Configuration Fileconfig.yaml

This Potato config reproduces the annotation task. Save it as config.yaml and run potato start config.yaml to try it.

yaml

# FAVA: Fine-grained Hallucination Annotations for Faithful Generation
# Based on Mishra et al., ACL 2024
# Paper: https://aclanthology.org/2024.acl-long.664/
# Dataset: https://github.com/fava-nlp/fava
#
# This task annotates fine-grained hallucination spans in LLM-generated text.
# Annotators compare generated text against a source and identify spans that
# contain hallucinations, classifying each by type.
#
# Hallucination Types:
# - Entity Error: Wrong entity mentioned (e.g., wrong person, place, or organization)
# - Relation Error: Entities are correct but the relation between them is wrong
# - Contradicted: Information directly contradicts the source text
# - Invented: Information not present in the source (fabricated details)
# - Subjective: Subjective claims presented as facts
# - Unverifiable: Claims that cannot be verified from the given source
#
# Annotation Guidelines:
# 1. Read the source text carefully to understand the ground truth
# 2. Read the generated text and compare against the source
# 3. Highlight each hallucinated span precisely (mark only the incorrect part)
# 4. Classify each span with the most specific hallucination type
# 5. Entity Error: The generated text mentions a wrong entity
# 6. Relation Error: Entities exist in source but their relationship is wrong
# 7. Contradicted: Generated text says the opposite of the source
# 8. Invented: Generated text includes details absent from the source
# 9. Subjective: Opinions or value judgments not in the source
# 10. Unverifiable: Cannot determine truth from the source alone

annotation_task_name: "FAVA: Hallucination Span Annotation"
task_dir: "."

data_files:
  - sample-data.json
item_properties:
  id_key: "id"
  text_key: "text"

output_annotation_dir: "annotation_output/"
output_annotation_format: "json"

annotation_schemes:
  # Step 1: Identify hallucinated spans and classify type
  - annotation_type: span
    name: hallucination_spans
    description: "Highlight spans in the generated text that contain hallucinations and select the hallucination type"
    labels:
      - "Entity Error"
      - "Relation Error"
      - "Contradicted"
      - "Invented"
      - "Subjective"
      - "Unverifiable"
    label_colors:
      "Entity Error": "#ef4444"
      "Relation Error": "#f97316"
      "Contradicted": "#dc2626"
      "Invented": "#8b5cf6"
      "Subjective": "#f59e0b"
      "Unverifiable": "#6b7280"
    keyboard_shortcuts:
      "Entity Error": "1"
      "Relation Error": "2"
      "Contradicted": "3"
      "Invented": "4"
      "Subjective": "5"
      "Unverifiable": "6"
    tooltips:
      "Entity Error": "A wrong entity is mentioned (wrong name, place, date, number, etc.)"
      "Relation Error": "Entities are correct but the relationship between them is wrong"
      "Contradicted": "The information directly contradicts what the source says"
      "Invented": "The information is fabricated and not present anywhere in the source"
      "Subjective": "A subjective opinion or value judgment presented as objective fact"
      "Unverifiable": "The claim cannot be verified or refuted using the given source"
    allow_overlapping: false

  # Step 2: Overall faithfulness rating
  - annotation_type: radio
    name: overall_faithfulness
    description: "Overall, how faithful is the generated text to the source?"
    labels:
      - "Fully Faithful"
      - "Mostly Faithful"
      - "Partially Faithful"
      - "Mostly Unfaithful"
      - "Completely Unfaithful"
    keyboard_shortcuts:
      "Fully Faithful": "a"
      "Mostly Faithful": "b"
      "Partially Faithful": "c"
      "Mostly Unfaithful": "d"
      "Completely Unfaithful": "f"
    tooltips:
      "Fully Faithful": "No hallucinations detected; all information matches the source"
      "Mostly Faithful": "Minor hallucinations that do not change the overall meaning"
      "Partially Faithful": "Some significant hallucinations mixed with faithful content"
      "Mostly Unfaithful": "Most of the generated text contains hallucinated information"
      "Completely Unfaithful": "The generated text is almost entirely hallucinated"

html_layout: |
  <div style="margin-bottom: 10px; padding: 10px; background: #f0fdf4; border-left: 4px solid #22c55e; border-radius: 4px;">
    <strong>Source Text:</strong><br>{{source_text}}
  </div>
  <div style="margin-bottom: 10px; padding: 10px; background: #fef2f2; border-left: 4px solid #ef4444; border-radius: 4px;">
    <strong>Generated Text (annotate hallucinations below):</strong><br>{{text}}
  </div>

allow_all_users: true
instances_per_annotator: 40
annotation_per_instance: 2
allow_skip: true
skip_reason_required: false

Sample Datasample-data.json

json

[
  {
    "id": "fava_001",
    "text": "The Eiffel Tower was built in 1889 for the World's Fair in Paris. Designed by Alexandre Gustave Eiffel, it stands 324 meters tall and was originally intended as a permanent monument to French engineering. It is the most visited paid monument in the world, attracting over 7 million visitors annually.",
    "source_text": "The Eiffel Tower is a wrought-iron lattice tower on the Champ de Mars in Paris. It was constructed from 1887 to 1889 as the centerpiece of the 1889 World's Fair. Named after engineer Gustave Eiffel, whose company designed and built the tower, it stands 330 metres tall. The tower was originally intended to be dismantled after 20 years but was saved because of its usefulness as a radio transmission tower. It is the most visited paid monument in the world, with 6.91 million visitors in 2015."
  },
  {
    "id": "fava_002",
    "text": "The Great Wall of China was primarily built during the Ming Dynasty between 1368 and 1644. It stretches approximately 13,000 miles across northern China. The wall was constructed to protect against invasions from Mongol tribes. It is visible from space with the naked eye and is one of the most impressive architectural feats in human history.",
    "source_text": "The Great Wall of China is a series of fortifications built along the historical northern borders of China. While walls were built as early as the 7th century BC, the most well-known sections were built during the Ming Dynasty (1368-1644). The total length of all sections ever built is approximately 13,171 miles (21,196 km). The wall was built primarily to protect Chinese states and empires against various nomadic groups. Contrary to popular belief, the Great Wall is not visible from space with the naked eye under normal conditions, according to astronauts."
  }
]

// ... and 8 more items

Get This Design

View on GitHub

Clone or download from the repository

Quick start:

git clone https://github.com/davidjurgens/potato-showcase.git
cd potato-showcase/text/fact-verification/fava-hallucination-spans
potato start config.yaml

Dataset & paper

Mishra et al., ACL 2024

Official dataset ↗Read the paper ↗

Citation (BibTeX)

bibtex

@inproceedings{mishra-etal-2024-fava,
    title = "{FAVA}: Fine-grained Hallucination Annotations for Faithful Generation",
    author = "Mishra, Abhika  and Celikyilmaz, Asli  and Thickstun, John  and Hashimoto, Tatsunori  and Iyer, Shankar",
    booktitle = "Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/2401.06855",
    pages = "12318--12349"
}

Details

Annotation Types

spanradio

Domain

NLPHallucination DetectionFaithfulness

Use Cases

Hallucination DetectionFaithfulness EvaluationLLM Output Assessment

Related Designs

Code Review Annotation (CodeReviewer)

Annotation of code review activities based on the CodeReviewer benchmark. Annotators identify issues in code diffs, classify defect types, assign severity levels, make review decisions, and provide natural language review comments, supporting research in automated code review and software engineering.

spanradio

EA-MT - Entity-Aware Machine Translation

Entity-aware machine translation evaluation requiring annotators to identify entity spans, classify translation errors, and provide corrected translations. Based on SemEval-2025 Task 2.

spanradio

MathDial: Math Tutoring Dialogue Dataset

MathDial is a dataset of 2,861 one-to-one math tutoring dialogues grounded in GSM8K word problems, annotated with a four-category teacher-move taxonomy. This Potato config reproduces those teacher-move labels.

radiomultiselect

FAVA: Fine-grained Hallucination Annotations for Faithful Generation

Configuration Fileconfig.yaml

Sample Datasample-data.json

Get This Design

Dataset & paper

Details

Annotation Types

Domain

Use Cases

Tags

Related Designs

Code Review Annotation (CodeReviewer)

EA-MT - Entity-Aware Machine Translation

MathDial: Math Tutoring Dialogue Dataset