Multilingual Emoji Prediction
Prediction of the most likely emoji for a given tweet, choosing from the 20 most frequent emojis. Based on SemEval-2018 Task 2 (Multilingual Emoji Prediction).
ملف الإعدادconfig.yaml
# Multilingual Emoji Prediction
# Based on Barbieri et al., SemEval 2018
# Paper: https://aclanthology.org/S18-1003/
# Dataset: https://competitions.codalab.org/competitions/17344
#
# This task asks annotators to predict the most appropriate emoji for a
# given tweet from a set of 20 most frequent emojis on Twitter.
#
# The 20 emoji categories represent the most commonly used emojis,
# described by their textual names for accessibility.
annotation_task_name: "Multilingual Emoji Prediction"
task_dir: "."
data_files:
- sample-data.json
item_properties:
id_key: "id"
text_key: "text"
output_annotation_dir: "annotation_output/"
output_annotation_format: "json"
port: 8000
server_name: localhost
annotation_schemes:
- annotation_type: radio
name: emoji_prediction
description: "Which emoji best matches this tweet?"
labels:
- "Heart"
- "Face Laughing"
- "Heart Eyes"
- "Fire"
- "Smiling"
- "Crying"
- "Two Hearts"
- "Kiss"
- "Folded Hands"
- "Clapping"
- "OK"
- "Sparkles"
- "Flexed Biceps"
- "Camera"
- "Sunglasses"
- "Blue Heart"
- "Musical Notes"
- "Smirking"
- "Sun"
- "Purple Heart"
tooltips:
"Heart": "Red heart - love, affection, strong positive emotion"
"Face Laughing": "Laughing face with tears - something very funny"
"Heart Eyes": "Heart-eyes face - adoration, infatuation"
"Fire": "Fire - something hot, trending, excellent"
"Smiling": "Smiling face - general happiness or friendliness"
"Crying": "Crying face - sadness or being deeply moved"
"Two Hearts": "Two hearts - mutual love, romance"
"Kiss": "Kiss mark - sending a kiss, affection"
"Folded Hands": "Folded hands - please, thank you, prayer"
"Clapping": "Clapping hands - applause, congratulations"
"OK": "OK hand - approval, agreement"
"Sparkles": "Sparkles - something special, magical, or new"
"Flexed Biceps": "Flexed biceps - strength, effort, working hard"
"Camera": "Camera - photography, capturing moments"
"Sunglasses": "Sunglasses face - cool, confident"
"Blue Heart": "Blue heart - trust, loyalty, platonic love"
"Musical Notes": "Musical notes - music, singing, rhythm"
"Smirking": "Smirking face - sly, suggestive, flirty"
"Sun": "Sun - sunny day, brightness, warmth"
"Purple Heart": "Purple heart - compassion, support, honor"
annotation_instructions: |
You will be shown a tweet with the emoji removed. Your task is to predict
which of the 20 most common emojis was originally used with this tweet.
Consider the tone, topic, and sentiment of the tweet.
html_layout: |
<div style="padding: 15px; max-width: 800px; margin: auto;">
<div style="background: #f0f9ff; border: 1px solid #bae6fd; border-radius: 8px; padding: 16px; margin-bottom: 16px;">
<strong style="color: #0369a1;">Tweet:</strong>
<p style="font-size: 16px; line-height: 1.7; margin: 8px 0 0 0;">{{text}}</p>
</div>
</div>
allow_all_users: true
instances_per_annotator: 50
annotation_per_instance: 2
allow_skip: true
skip_reason_required: false
بيانات نموذجيةsample-data.json
[
{
"id": "emoji_001",
"text": "I love you so much babe, you mean the world to me"
},
{
"id": "emoji_002",
"text": "When your friend falls off the chair in the middle of class and everyone stares"
}
]
// ... and 8 more itemsاحصل على هذا التصميم
Clone or download from the repository
بدء سريع:
git clone https://github.com/davidjurgens/potato-showcase.git cd potato-showcase/semeval/2018/task02-emoji-prediction potato start config.yaml
التفاصيل
أنواع التوسيم
المجال
حالات الاستخدام
الوسوم
وجدت مشكلة أو تريد تحسين هذا التصميم؟
افتح مشكلةتصاميم ذات صلة
AfriSenti - African Language Sentiment
Sentiment analysis for tweets in African languages, classifying text as positive, negative, or neutral. Covers 14 African languages including Amharic, Hausa, Igbo, Yoruba, and Swahili. Based on SemEval-2023 Task 12 (Muhammad et al.).
Detecting Stance in Tweets
Classification of stance expressed in tweets toward specific targets as favor, against, or neither. Based on SemEval-2016 Task 6 (Stance Detection).
Explainable Online Sexism Detection
Detection and fine-grained classification of online sexism with span-level evidence extraction. Categories include threats, derogation, animosity, and prejudiced discussion. Based on SemEval-2023 Task 10 (Kirk et al.).