Tweet Intimacy Analysis

Rating the level of intimacy expressed in tweets on a 5-point scale, covering multilingual social media content. Based on SemEval-2023 Task 9 (Pei et al.).

Configuration Fileconfig.yaml

# Tweet Intimacy Analysis
# Based on Pei et al., SemEval 2023
# Paper: https://aclanthology.org/2023.semeval-1.309/
# Dataset: https://github.com/LittleYUYU/MINT
#
# This task asks annotators to rate the level of intimacy expressed in
# tweets on a 5-point Likert scale. Intimacy refers to how personal,
# private, or emotionally close the content of the tweet is.
#
# Scale:
# 1 - Not Intimate: Public, impersonal content (news, facts)
# 2 - Slightly Intimate: Mildly personal opinions or preferences
# 3 - Moderately Intimate: Personal experiences or feelings shared broadly
# 4 - Intimate: Private feelings, personal relationships, or vulnerabilities
# 5 - Very Intimate: Deeply personal disclosures, confessions, or private matters

annotation_task_name: "Tweet Intimacy Analysis"
task_dir: "."

data_files:
  - sample-data.json

item_properties:
  id_key: "id"
  text_key: "text"

output_annotation_dir: "annotation_output/"
output_annotation_format: "json"

port: 8000
server_name: localhost

annotation_schemes:
  - annotation_type: likert
    name: intimacy_rating
    description: "How intimate is the content of this tweet?"
    min_label: "Not Intimate"
    max_label: "Very Intimate"
    size: 5

annotation_instructions: |
  You will see a tweet and its language. Rate the level of intimacy expressed in the tweet
  on a scale from 1 (Not Intimate) to 5 (Very Intimate).
  - 1: Public, impersonal content (news headlines, factual statements)
  - 2: Slightly personal opinions or general preferences
  - 3: Moderately personal experiences or feelings shared broadly
  - 4: Private feelings, personal relationships, or vulnerabilities
  - 5: Deeply personal disclosures, confessions, or private matters

html_layout: |
  <div style="padding: 15px; max-width: 800px; margin: auto;">
    <div style="background: #faf5ff; border: 1px solid #e9d5ff; border-radius: 8px; padding: 12px; margin-bottom: 12px;">
      <strong style="color: #7e22ce;">Language:</strong>
      <span style="font-size: 15px; margin-left: 8px;">{{language}}</span>
    </div>
    <div style="background: #f0f9ff; border: 1px solid #bae6fd; border-radius: 8px; padding: 16px; margin-bottom: 16px;">
      <strong style="color: #0369a1;">Tweet:</strong>
      <p style="font-size: 16px; line-height: 1.7; margin: 8px 0 0 0;">{{text}}</p>
    </div>
  </div>

allow_all_users: true
instances_per_annotator: 50
annotation_per_instance: 3
allow_skip: true
skip_reason_required: false

Sample Datasample-data.json

[
  {
    "id": "intimacy_001",
    "text": "Breaking: The Federal Reserve has announced a 0.25% interest rate hike, marking the fourth consecutive increase this year.",
    "language": "English"
  },
  {
    "id": "intimacy_002",
    "text": "I can't stop thinking about what happened between us last night. I don't know if I'll ever feel the same way about anyone again.",
    "language": "English"
  }
]

// ... and 8 more items

Get This Design

View on GitHub

Clone or download from the repository

Quick start:

git clone https://github.com/davidjurgens/potato-showcase.git
cd potato-showcase/semeval/2023/task09-tweet-intimacy
potato start config.yaml

Details

Annotation Types

likert

Domain

NLPSocial MediaSemEval

Use Cases

Intimacy DetectionSocial Media AnalysisMultilingual NLP

Related Designs

Assessing Humor in Edited News Headlines

Rate the funniness of edited news headlines and compare humor between original and edited versions, based on SemEval-2020 Task 7 (Hossain et al.). Headlines are minimally edited by replacing a single word to create humorous effect.

likertradio

Clickbait Detection (Webis Clickbait Corpus)

Classify headlines and social media posts as clickbait or non-clickbait based on the Webis Clickbait Corpus. Identify manipulative content designed to attract clicks through sensationalism, curiosity gaps, or misleading framing.

likertmultiselect

Determining Sentiment Intensity of English and Arabic Phrases

Fine-grained sentiment intensity scoring of text phrases on an 11-point scale from most negative to most positive. Based on SemEval-2016 Task 7.