Character Identification on Multiparty Dialogues
Identification and linking of character mentions in TV show dialogue, combining span annotation with entity resolution for the main cast of Friends. Based on SemEval-2018 Task 4.
Configuration Fileconfig.yaml
# Character Identification on Multiparty Dialogues
# Based on Chen et al., SemEval 2018
# Paper: https://aclanthology.org/S18-1007/
# Dataset: https://github.com/emorynlp/character-identification
#
# This task asks annotators to identify character mentions in TV show
# dialogue from Friends. Annotators highlight character mention spans
# and link them to the correct character entity.
#
# Span Labels:
# - Character Mention: A reference to a character (name, pronoun, description)
#
# Character Labels:
# - Monica, Rachel, Ross, Chandler, Joey, Phoebe, Other
annotation_task_name: "Character Identification on Multiparty Dialogues"
task_dir: "."
data_files:
- sample-data.json
item_properties:
id_key: "id"
text_key: "text"
output_annotation_dir: "annotation_output/"
output_annotation_format: "json"
port: 8000
server_name: localhost
annotation_schemes:
- annotation_type: span
name: character_mentions
description: "Highlight all character mentions (names, pronouns, descriptions referring to characters)."
labels:
- "Character Mention"
- annotation_type: radio
name: character_identity
description: "Which character does the primary mention refer to?"
labels:
- "Monica"
- "Rachel"
- "Ross"
- "Chandler"
- "Joey"
- "Phoebe"
- "Other"
keyboard_shortcuts:
"Monica": "1"
"Rachel": "2"
"Ross": "3"
"Chandler": "4"
"Joey": "5"
"Phoebe": "6"
"Other": "7"
tooltips:
"Monica": "Monica Geller - chef, organized, competitive"
"Rachel": "Rachel Green - fashion enthusiast, waitress turned executive"
"Ross": "Ross Geller - paleontologist, Monica's brother"
"Chandler": "Chandler Bing - statistical analysis, sarcastic humor"
"Joey": "Joey Tribbiani - actor, food lover"
"Phoebe": "Phoebe Buffay - masseuse, musician, quirky"
"Other": "A character not in the main six cast members"
annotation_instructions: |
You will be shown a dialogue excerpt from the TV show Friends along with a scene
description. Your task is to:
1. Highlight all mentions of characters in the dialogue (names, pronouns, descriptions).
2. Select which main character the primary mention refers to.
html_layout: |
<div style="padding: 15px; max-width: 800px; margin: auto;">
<div style="background: #fefce8; border: 1px solid #fde68a; border-radius: 8px; padding: 12px; margin-bottom: 12px;">
<strong style="color: #a16207;">Scene Description:</strong>
<p style="font-size: 14px; line-height: 1.5; margin: 6px 0 0 0;">{{scene_description}}</p>
</div>
<div style="background: #f0f9ff; border: 1px solid #bae6fd; border-radius: 8px; padding: 16px; margin-bottom: 16px;">
<strong style="color: #0369a1;">Dialogue:</strong>
<p style="font-size: 16px; line-height: 1.7; margin: 8px 0 0 0;">{{text}}</p>
</div>
</div>
allow_all_users: true
instances_per_annotator: 50
annotation_per_instance: 2
allow_skip: true
skip_reason_required: false
Sample Datasample-data.json
[
{
"id": "charid_001",
"text": "Monica: I can't believe you ate my last piece of cake! Joey: It was just sitting there! She didn't even put her name on it!",
"scene_description": "Monica's apartment, kitchen area. Monica is standing by the refrigerator looking upset."
},
{
"id": "charid_002",
"text": "Ross: So I told her about the fossil discovery. Rachel: And she was interested? Ross: She fell asleep. Chandler: Shocking.",
"scene_description": "Central Perk. The group is sitting on the couch."
}
]
// ... and 8 more itemsGet This Design
Clone or download from the repository
Quick start:
git clone https://github.com/davidjurgens/potato-showcase.git cd potato-showcase/semeval/2018/task04-character-identification potato start config.yaml
Details
Annotation Types
Domain
Use Cases
Tags
Found an issue or want to improve this design?
Open an IssueRelated Designs
Emotion Discovery and Reasoning its Flip
Emotion discovery and flip reasoning task requiring annotators to identify the current emotion in a dialogue turn, and locate trigger spans and flip points where emotions change. Based on SemEval-2024 Task 10.
Aspect-Based Sentiment Analysis
Identification of aspect terms in review text with sentiment polarity classification for each aspect. Based on SemEval-2016 Task 5 (ABSA).
Causal Medical Claim Detection and PICO Extraction
Detection of causal claims in medical texts and extraction of PICO (Population, Intervention, Comparator, Outcome) elements. Based on SemEval-2023 Task 8 (Khetan et al.).