Coreference Resolution (OntoNotes)
Link pronouns and noun phrases to the entities they refer to in text. Based on the OntoNotes coreference annotation guidelines and CoNLL shared tasks. Identify mention spans and cluster coreferent mentions together.
text annotation
Configuration Fileconfig.yaml
# Coreference Resolution Configuration
# Link pronouns and mentions to their referents
annotation_task_name: "Coreference Resolution"
data_files:
- "data/documents.json"
item_properties:
id_key: "id"
text_display_key: "display"
user_config:
allow_all_users: true
annotation_schemes:
- annotation_type: "radio"
name: "coreference"
description: "Does the highlighted mention refer to the same entity as the target?"
labels:
- name: "Same Entity"
tooltip: "The mention refers to the same entity"
key_value: "y"
color: "#22c55e"
- name: "Different Entity"
tooltip: "The mention refers to a different entity"
key_value: "n"
color: "#ef4444"
- name: "Ambiguous"
tooltip: "Cannot determine from context"
key_value: "a"
color: "#eab308"
- annotation_type: "radio"
name: "mention_type"
description: "What type of mention is this?"
labels:
- name: "Pronoun"
tooltip: "he, she, it, they, etc."
- name: "Proper noun"
tooltip: "Names of people, places, organizations"
- name: "Common noun"
tooltip: "the company, the scientist, etc."
- name: "Demonstrative"
tooltip: "this, that, these, those"
- annotation_type: "text"
name: "antecedent"
description: "What is the full antecedent (what does this refer to)?"
- annotation_type: "likert"
name: "difficulty"
description: "How difficult was this decision?"
size: 5
min_label: "Very easy"
max_label: "Very difficult"
output: "annotation_output/"
Sample Datasample-data.json
[
{
"id": "coref_001",
"text": "Sarah told Emily that she would help with the project tomorrow.",
"target_entity": "Sarah",
"mention": "she",
"mention_position": [
24,
27
],
"display": "**Text:** Sarah told Emily that **[she]** would help with the project tomorrow.\n\n**Target Entity:** Sarah\n**Mention to evaluate:** she\n\n*Question: Does 'she' refer to 'Sarah'?*"
},
{
"id": "coref_002",
"text": "The company announced record profits. It plans to expand into new markets next year.",
"target_entity": "The company",
"mention": "It",
"mention_position": [
37,
39
],
"display": "**Text:** The company announced record profits. **[It]** plans to expand into new markets next year.\n\n**Target Entity:** The company\n**Mention to evaluate:** It\n\n*Question: Does 'It' refer to 'The company'?*"
}
]
// ... and 2 more itemsGet This Design
Clone or download from the repository
Quick start:
git clone https://github.com/davidjurgens/potato-showcase.git cd potato-showcase/coreference-resolution potato start config.yaml
Details
Annotation Types
Domain
Use Cases
Tags
Found an issue or want to improve this design?
Open an IssueRelated Designs
Dialogue Relation Extraction (DialogRE)
Extract relations between entities in dialogue. Based on Yu et al., ACL 2020. Identify 36 relation types between speakers and entities mentioned in conversations.
Emotion Cause Extraction (RECCON)
Extract emotion causes from conversational text based on RECCON (Poria et al., EMNLP 2020). Identify which utterances and specific spans caused an emotion expressed in dialogue.
Event Argument Extraction (MAVEN-Arg)
Document-level event argument extraction based on MAVEN-Arg (Wang et al., ACL 2024). Annotates event triggers with their argument roles including Agent, Patient, Location, Time, Instrument, and more. Supports both entity and non-entity arguments across document context.