QA over Tabular Data
Question answering over tabular data task requiring annotators to answer questions based on table content and classify answerability. Based on SemEval-2025 Task 8.
Configuration Fileconfig.yaml
# QA over Tabular Data
# Based on SemEval 2025 Task 8 Organizers, SemEval 2025
# Paper: https://aclanthology.org/volumes/2025.semeval-1/
# Dataset: https://github.com/SemEval/semeval-2025-task8
#
# This task asks annotators to answer questions based on tabular data,
# and to classify whether the question is answerable from the table alone,
# requires external knowledge, is ambiguous, or is unanswerable.
annotation_task_name: "QA over Tabular Data"
task_dir: "."
data_files:
- sample-data.json
item_properties:
id_key: "id"
text_key: "text"
output_annotation_dir: "annotation_output/"
output_annotation_format: "json"
port: 8000
server_name: localhost
annotation_schemes:
- annotation_type: text
name: answer
description: "Provide the answer to the question based on the table."
- annotation_type: radio
name: answerability
description: "Can the question be answered from the table alone?"
labels:
- "Answerable from Table"
- "Requires External Knowledge"
- "Ambiguous"
- "Unanswerable"
keyboard_shortcuts:
"Answerable from Table": "1"
"Requires External Knowledge": "2"
"Ambiguous": "3"
"Unanswerable": "4"
tooltips:
"Answerable from Table": "The answer can be found directly in the table"
"Requires External Knowledge": "Answering requires knowledge not in the table"
"Ambiguous": "The question is unclear or could have multiple interpretations"
"Unanswerable": "The question cannot be answered even with external knowledge"
annotation_instructions: |
You will be shown a question and a table. Your task is to:
1. Read the question carefully.
2. Examine the table content provided.
3. Provide an answer if possible.
4. Classify whether the question is answerable from the table alone.
html_layout: |
<div style="padding: 15px; max-width: 800px; margin: auto;">
<div style="background: #f0f9ff; border: 1px solid #bae6fd; border-radius: 8px; padding: 16px; margin-bottom: 16px;">
<strong style="color: #0369a1;">Question:</strong>
<p style="font-size: 16px; line-height: 1.7; margin: 8px 0 0 0;">{{text}}</p>
</div>
<div style="background: #fefce8; border: 1px solid #fde68a; border-radius: 8px; padding: 16px; margin-bottom: 16px;">
<strong style="color: #a16207;">Table: {{table_title}}</strong>
<pre style="font-size: 14px; line-height: 1.5; margin: 8px 0 0 0; white-space: pre-wrap;">{{table_content}}</pre>
</div>
</div>
allow_all_users: true
instances_per_annotator: 50
annotation_per_instance: 2
allow_skip: true
skip_reason_required: false
Sample Datasample-data.json
[
{
"id": "tabqa_001",
"text": "What is the total revenue for Q3 2024?",
"table_content": "Quarter | Revenue ($M) | Expenses ($M) | Profit ($M)\nQ1 2024 | 45.2 | 32.1 | 13.1\nQ2 2024 | 48.7 | 33.5 | 15.2\nQ3 2024 | 52.3 | 35.8 | 16.5\nQ4 2024 | 55.1 | 37.2 | 17.9",
"table_title": "Quarterly Financial Report 2024"
},
{
"id": "tabqa_002",
"text": "Which country has the highest life expectancy?",
"table_content": "Country | Life Expectancy | Population (M) | GDP per Capita ($)\nJapan | 84.6 | 125.7 | 42,940\nSwitzerland | 83.8 | 8.7 | 87,097\nAustralia | 83.4 | 25.9 | 55,057\nCanada | 82.4 | 38.2 | 46,195",
"table_title": "Country Health and Economic Indicators"
}
]
// ... and 8 more itemsGet This Design
Clone or download from the repository
Quick start:
git clone https://github.com/davidjurgens/potato-showcase.git cd potato-showcase/semeval/2025/task08-tabular-qa potato start config.yaml
Details
Annotation Types
Domain
Use Cases
Tags
Found an issue or want to improve this design?
Open an IssueRelated Designs
BRAINTEASER - Commonsense-Defying QA
Lateral thinking and commonsense-defying question answering task requiring annotators to select answers to brain teasers that defy default commonsense assumptions and provide explanations. Based on SemEval-2024 Task 9 (BRAINTEASER).
Math Question Answering and Category Classification
Mathematical question answering with category classification, covering algebra, geometry, number theory, and statistics. Based on SemEval-2019 Task 10 (Math QA).
R2VQ - Recipe Question Answering
Question answering over recipe texts, where annotators answer questions about recipes and classify whether the answer is explicitly stated, inferable, or not answerable from the recipe. Based on SemEval-2022 Task 9 (Jia et al.).