KG-BERT Knowledge Graph Triple Validation

Validate knowledge graph triples for correctness and annotate relation types based on the KG-BERT framework. Annotators assess whether entity-relation-entity triples are valid, classify the relation type, and provide entity descriptions.

配置文件config.yaml

# KG-BERT Knowledge Graph Triple Validation
# Based on Yao et al., AAAI 2020
# Paper: https://arxiv.org/abs/1909.03193
# Dataset: https://github.com/yao8839836/kg-bert
#
# Validate knowledge graph triples and annotate relation types.
# Each item presents a subject-relation-object triple. Annotators
# assess whether the triple is valid, classify the relation type,
# and provide entity descriptions.
#
# Triple Validity:
# - Valid Triple: The relationship is factually correct
# - Invalid Triple: The relationship is factually incorrect
# - Uncertain: Cannot determine validity with available knowledge
#
# Relation Types:
# hypernym_of, part_of, located_in, instance_of, member_of,
# has_property, cause_of, used_for
#
# Annotation Guidelines:
# 1. Read the triple description and individual entities
# 2. Assess whether the stated relationship holds
# 3. Select the most appropriate relation type
# 4. Provide a brief description of the head entity

annotation_task_name: "KG-BERT Knowledge Graph Triple Validation"
task_dir: "."

data_files:
  - sample-data.json

item_properties:
  id_key: "id"
  text_key: "text"

output_annotation_dir: "annotation_output/"
output_annotation_format: "json"

port: 8000
server_name: localhost

annotation_schemes:
  - annotation_type: radio
    name: triple_validity
    description: "Is this knowledge graph triple valid?"
    labels:
      - "Valid Triple"
      - "Invalid Triple"
      - "Uncertain"
    keyboard_shortcuts:
      "Valid Triple": "1"
      "Invalid Triple": "2"
      "Uncertain": "3"
    tooltips:
      "Valid Triple": "The stated relationship between the entities is factually correct"
      "Invalid Triple": "The stated relationship is factually incorrect or nonsensical"
      "Uncertain": "Cannot determine the validity with reasonable confidence"

  - annotation_type: select
    name: relation_type
    description: "What type of relation best describes this triple?"
    labels:
      - "hypernym_of"
      - "part_of"
      - "located_in"
      - "instance_of"
      - "member_of"
      - "has_property"
      - "cause_of"
      - "used_for"
    tooltips:
      "hypernym_of": "The head entity is a broader category of the tail entity"
      "part_of": "The head entity is a component or part of the tail entity"
      "located_in": "The head entity is geographically or physically located in the tail entity"
      "instance_of": "The head entity is a specific instance of the tail entity class"
      "member_of": "The head entity belongs to or is a member of the tail entity group"
      "has_property": "The head entity possesses the property described by the tail entity"
      "cause_of": "The head entity causes or leads to the tail entity"
      "used_for": "The head entity is used for the purpose described by the tail entity"

  - annotation_type: text
    name: entity_description
    description: "Provide a brief description of the head entity (1-2 sentences)."

annotation_instructions: |
  You will validate knowledge graph triples from the KG-BERT framework.

  For each item:
  1. Read the triple: head entity -> relation -> tail entity.
  2. Assess whether the relationship is valid (factually correct).
  3. Select the relation type that best categorizes the relationship.
  4. Write a brief description of the head entity.

  Examples of valid triples:
  - (Paris, located_in, France) -- Valid
  - (Dog, hypernym_of, Animal) -- Invalid (it should be Animal hypernym_of Dog)
  - (Wheel, part_of, Car) -- Valid

  When uncertain, consider common knowledge and widely accepted facts.

html_layout: |
  <div style="padding: 15px; max-width: 800px; margin: auto;">
    <div style="background: #fefce8; border: 1px solid #fde68a; border-radius: 8px; padding: 16px; margin-bottom: 16px;">
      <strong style="color: #a16207;">Triple Description:</strong>
      <p style="font-size: 16px; line-height: 1.7; margin: 8px 0 0 0;">{{text}}</p>
    </div>
    <div style="display: grid; grid-template-columns: 1fr 1fr 1fr; gap: 10px; margin-bottom: 16px;">
      <div style="background: #f0fdf4; border: 1px solid #86efac; border-radius: 8px; padding: 12px; text-align: center;">
        <strong style="color: #166534;">Head Entity</strong>
        <p style="font-size: 15px; margin: 6px 0 0 0;">{{head_entity}}</p>
      </div>
      <div style="background: #f5f3ff; border: 1px solid #c4b5fd; border-radius: 8px; padding: 12px; text-align: center;">
        <strong style="color: #6d28d9;">Relation</strong>
        <p style="font-size: 15px; margin: 6px 0 0 0;">{{relation}}</p>
      </div>
      <div style="background: #f0f9ff; border: 1px solid #bae6fd; border-radius: 8px; padding: 12px; text-align: center;">
        <strong style="color: #0369a1;">Tail Entity</strong>
        <p style="font-size: 15px; margin: 6px 0 0 0;">{{tail_entity}}</p>
      </div>
    </div>
  </div>

allow_all_users: true
instances_per_annotator: 50
annotation_per_instance: 2
allow_skip: true
skip_reason_required: false

示例数据sample-data.json

[
  {
    "id": "kg_001",
    "text": "Paris is located in France.",
    "head_entity": "Paris",
    "relation": "located_in",
    "tail_entity": "France"
  },
  {
    "id": "kg_002",
    "text": "A wheel is part of a bicycle.",
    "head_entity": "Wheel",
    "relation": "part_of",
    "tail_entity": "Bicycle"
  }
]

// ... and 8 more items

获取此设计

View on GitHub

Clone or download from the repository

快速开始：

git clone https://github.com/davidjurgens/potato-showcase.git
cd potato-showcase/text/information-extraction/kgbert-knowledge-graph
potato start config.yaml

详情

标注类型

radioselecttext

领域

NLPKnowledge Graphs

应用场景

Knowledge Graph CompletionTriple ValidationRelation Classification

KG-BERT Knowledge Graph Triple Validation

配置文件config.yaml

示例数据sample-data.json

获取此设计

详情

标注类型

领域

应用场景

标签

相关设计

Named Entity Disambiguation (AIDA-CoNLL)

Argument Reasoning in Civil Procedure

BRAINTEASER - Commonsense-Defying QA