Skip to content
Showcase/OntoNotes - Coreference Resolution
advancedtext

OntoNotes - Coreference Resolution

Coreference resolution annotation based on the OntoNotes 5.0 corpus (Pradhan et al., CoNLL 2012). Annotators identify coreferent mentions -- expressions that refer to the same real-world entity -- and link them into coreference chains across multi-sentence text.

Chains:Chain 1 (3)Chain 2 (2)Click mentions to group into chains

File di configurazioneconfig.yaml

# OntoNotes - Coreference Resolution
# Based on Pradhan et al., CoNLL 2012
# Paper: https://aclanthology.org/W12-4501/
# Dataset: https://catalog.ldc.upenn.edu/LDC2013T19
#
# Coreference resolution involves identifying all expressions (mentions)
# in a text that refer to the same entity and grouping them into chains.
#
# Mention types:
# - Proper nouns: "Barack Obama", "Microsoft"
# - Common noun phrases: "the president", "the company"
# - Pronouns: "he", "she", "it", "they"
# - Demonstratives: "this", "that"
#
# Guidelines:
# - Mark all mentions of each entity, including pronouns
# - Link mentions that refer to the same real-world entity
# - Include nested mentions when appropriate (e.g., "his" in "his company")
# - Appositives should be linked (e.g., "Obama, the president")
# - Generic mentions (e.g., "people" in general) are typically not linked

annotation_task_name: "OntoNotes: Coreference Resolution"
task_dir: "."

data_files:
  - sample-data.json

item_properties:
  id_key: "id"
  text_key: "text"

output_annotation_dir: "annotation_output/"
output_annotation_format: "json"

port: 8000
server_name: localhost

annotation_schemes:
  - annotation_type: span
    name: mention_spans
    description: "Highlight all entity mentions (noun phrases and pronouns) in the text"
    labels:
      - "Mention"
    tooltips:
      "Mention": "Any expression referring to an entity: proper nouns, common nouns, pronouns, or demonstratives"

  - annotation_type: coreference
    name: coreference_chains
    description: "Link mentions that refer to the same real-world entity into coreference chains"

annotation_instructions: |
  Annotate coreference relations in the text:
  1. First, highlight all entity mentions using the span tool (nouns, pronouns, noun phrases).
  2. Then, use the coreference tool to link mentions that refer to the same entity.
  3. Include pronouns (he, she, it, they) and demonstratives (this, that) as mentions.
  4. An entity may have many mentions across multiple sentences.
  5. Do not link generic references (e.g., "people" used generically).

html_layout: |
  <div style="padding: 15px; max-width: 800px; margin: auto;">
    <div style="background: #f0f9ff; border: 1px solid #bae6fd; border-radius: 8px; padding: 16px; margin-bottom: 16px;">
      <strong style="color: #0369a1;">Text:</strong>
      <p style="font-size: 16px; line-height: 2.0; margin: 8px 0 0 0;">{{text}}</p>
    </div>
    <div style="background: #fefce8; border: 1px solid #fde68a; border-radius: 8px; padding: 12px;">
      <p style="font-size: 13px; color: #713f12; margin: 0;"><strong>Instructions:</strong> Mark all entity mentions, then link those that refer to the same entity into coreference chains.</p>
    </div>
  </div>

allow_all_users: true
instances_per_annotator: 50
annotation_per_instance: 2
allow_skip: true
skip_reason_required: false

Dati di esempiosample-data.json

[
  {
    "id": "onto_001",
    "text": "Maria Chen joined the research team at Stanford University last September. She had previously worked at Google, where she led a team developing natural language processing models. Her new colleagues welcomed her warmly, noting that her expertise would be invaluable to the department."
  },
  {
    "id": "onto_002",
    "text": "The International Space Station orbits Earth approximately every 90 minutes. It has been continuously inhabited since November 2000. NASA and its partner agencies maintain the station, performing regular upgrades to keep it operational. The orbiting laboratory hosts experiments that cannot be conducted anywhere else."
  }
]

// ... and 8 more items

Ottieni questo design

View on GitHub

Clone or download from the repository

Avvio rapido:

git clone https://github.com/davidjurgens/potato-showcase.git
cd potato-showcase/text/coreference/ontonotes-coreference-resolution
potato start config.yaml

Dettagli

Tipi di annotazione

coreferencespan

Dominio

NLPLinguistics

Casi d'uso

Coreference ResolutionEntity LinkingDiscourse Analysis

Tag

ontonotescoreferencemention-detectionentity-chainsconll2012

Hai trovato un problema o vuoi migliorare questo design?

Apri un problema