Skip to content
Showcase/OntoNotes - Coreference Resolution
advancedtext

OntoNotes - Coreference Resolution

Coreference resolution annotation based on the OntoNotes 5.0 corpus (Pradhan et al., CoNLL 2012). Annotators identify coreferent mentions -- expressions that refer to the same real-world entity -- and link them into coreference chains across multi-sentence text.

Chains:Chain 1 (3)Chain 2 (2)Click mentions to group into chains

配置文件config.yaml

# OntoNotes - Coreference Resolution
# Based on Pradhan et al., CoNLL 2012
# Paper: https://aclanthology.org/W12-4501/
# Dataset: https://catalog.ldc.upenn.edu/LDC2013T19
#
# Coreference resolution involves identifying all expressions (mentions)
# in a text that refer to the same entity and grouping them into chains.
#
# Mention types:
# - Proper nouns: "Barack Obama", "Microsoft"
# - Common noun phrases: "the president", "the company"
# - Pronouns: "he", "she", "it", "they"
# - Demonstratives: "this", "that"
#
# Guidelines:
# - Mark all mentions of each entity, including pronouns
# - Link mentions that refer to the same real-world entity
# - Include nested mentions when appropriate (e.g., "his" in "his company")
# - Appositives should be linked (e.g., "Obama, the president")
# - Generic mentions (e.g., "people" in general) are typically not linked

annotation_task_name: "OntoNotes: Coreference Resolution"
task_dir: "."

data_files:
  - sample-data.json

item_properties:
  id_key: "id"
  text_key: "text"

output_annotation_dir: "annotation_output/"
output_annotation_format: "json"

port: 8000
server_name: localhost

annotation_schemes:
  - annotation_type: span
    name: mention_spans
    description: "Highlight all entity mentions (noun phrases and pronouns) in the text"
    labels:
      - "Mention"
    tooltips:
      "Mention": "Any expression referring to an entity: proper nouns, common nouns, pronouns, or demonstratives"

  - annotation_type: coreference
    name: coreference_chains
    description: "Link mentions that refer to the same real-world entity into coreference chains"

annotation_instructions: |
  Annotate coreference relations in the text:
  1. First, highlight all entity mentions using the span tool (nouns, pronouns, noun phrases).
  2. Then, use the coreference tool to link mentions that refer to the same entity.
  3. Include pronouns (he, she, it, they) and demonstratives (this, that) as mentions.
  4. An entity may have many mentions across multiple sentences.
  5. Do not link generic references (e.g., "people" used generically).

html_layout: |
  <div style="padding: 15px; max-width: 800px; margin: auto;">
    <div style="background: #f0f9ff; border: 1px solid #bae6fd; border-radius: 8px; padding: 16px; margin-bottom: 16px;">
      <strong style="color: #0369a1;">Text:</strong>
      <p style="font-size: 16px; line-height: 2.0; margin: 8px 0 0 0;">{{text}}</p>
    </div>
    <div style="background: #fefce8; border: 1px solid #fde68a; border-radius: 8px; padding: 12px;">
      <p style="font-size: 13px; color: #713f12; margin: 0;"><strong>Instructions:</strong> Mark all entity mentions, then link those that refer to the same entity into coreference chains.</p>
    </div>
  </div>

allow_all_users: true
instances_per_annotator: 50
annotation_per_instance: 2
allow_skip: true
skip_reason_required: false

示例数据sample-data.json

[
  {
    "id": "onto_001",
    "text": "Maria Chen joined the research team at Stanford University last September. She had previously worked at Google, where she led a team developing natural language processing models. Her new colleagues welcomed her warmly, noting that her expertise would be invaluable to the department."
  },
  {
    "id": "onto_002",
    "text": "The International Space Station orbits Earth approximately every 90 minutes. It has been continuously inhabited since November 2000. NASA and its partner agencies maintain the station, performing regular upgrades to keep it operational. The orbiting laboratory hosts experiments that cannot be conducted anywhere else."
  }
]

// ... and 8 more items

获取此设计

View on GitHub

Clone or download from the repository

快速开始:

git clone https://github.com/davidjurgens/potato-showcase.git
cd potato-showcase/text/coreference/ontonotes-coreference-resolution
potato start config.yaml

详情

标注类型

coreferencespan

领域

NLPLinguistics

应用场景

Coreference ResolutionEntity LinkingDiscourse Analysis

标签

ontonotescoreferencemention-detectionentity-chainsconll2012

发现问题或想改进此设计?

提交 Issue