Skip to content
Esta página aún no está disponible en su idioma. Se muestra la versión en inglés.

Bildannotation

Bilder mit Klassifikation, Bounding Boxes und Regionsmarkierung annotieren.

Bildannotation

Potato unterstützt Bildannotation für Klassifikations-, Objekterkennungs- und Regionsmarkierungsaufgaben.

Bildanzeige aktivieren

yaml
image:
  enabled: true
  max_width: 800
  max_height: 600

Datenformat

Bilder in Ihren Daten referenzieren:

json
{
  "id": "img_1",
  "image_path": "images/photo_001.jpg",
  "description": "Optional description"
}

Das Bildfeld konfigurieren:

yaml
data_files:
  - path: data/image_tasks.json
    image_field: image_path

Bildklassifikation

Gesamte Bilder klassifizieren:

yaml
annotation_schemes:
  - annotation_type: radio
    name: category
    description: "What is shown in this image?"
    labels:
      - Cat
      - Dog
      - Bird
      - Other
 
  - annotation_type: multiselect
    name: attributes
    description: "Select all that apply"
    labels:
      - Indoor
      - Outdoor
      - Multiple animals
      - Human present

Multi-Label-Klassifikation

yaml
annotation_schemes:
  - annotation_type: multiselect
    name: objects
    description: "What objects are visible?"
    labels:
      - Person
      - Car
      - Building
      - Tree
      - Animal
      - Furniture
      - Food
      - Electronic device

Bildqualitätsbewertung

yaml
annotation_schemes:
  - annotation_type: likert
    name: quality
    description: "Overall image quality"
    size: 5
    min_label: "Very poor"
    max_label: "Excellent"
 
  - annotation_type: multiselect
    name: issues
    description: "Select any quality issues"
    labels:
      - Blurry
      - Overexposed
      - Underexposed
      - Noisy
      - Low resolution
      - Watermark visible

Bounding-Box-Annotation

Boxen um Objekte zeichnen:

yaml
annotation_schemes:
  - annotation_type: bbox
    name: objects
    description: "Draw boxes around objects"
    labels:
      - Person
      - Car
      - Bicycle
      - Traffic sign
    label_colors:
      Person: "#3b82f6"
      Car: "#10b981"
      Bicycle: "#f59e0b"
      "Traffic sign": "#ef4444"

Bounding-Box-Ausgabe

json
{
  "id": "img_1",
  "objects": [
    {
      "label": "Person",
      "x": 100,
      "y": 50,
      "width": 80,
      "height": 200
    },
    {
      "label": "Car",
      "x": 300,
      "y": 150,
      "width": 150,
      "height": 100
    }
  ]
}

Vorgeladene Bounding Boxes

Vorhandene Annotationen zur Überprüfung laden:

json
{
  "id": "img_1",
  "image_path": "images/photo_001.jpg",
  "predictions": [
    {"label": "Person", "x": 100, "y": 50, "width": 80, "height": 200, "confidence": 0.95}
  ]
}
yaml
annotation_schemes:
  - annotation_type: bbox
    name: objects
    load_predictions: true
    prediction_field: predictions

Regions-/Polygon-Annotation

Für nicht-rechteckige Regionen:

yaml
annotation_schemes:
  - annotation_type: polygon
    name: regions
    description: "Outline regions of interest"
    labels:
      - Building
      - Road
      - Vegetation
      - Water

Bildvergleich

Zwei Bilder vergleichen:

yaml
data_files:
  - path: data/image_pairs.json
    item_a_field: image_original
    item_b_field: image_edited
 
annotation_schemes:
  - annotation_type: pairwise
    name: preference
    description: "Which image looks better?"
    options:
      - label: "Original"
        value: "A"
      - label: "Edited"
        value: "B"
      - label: "Same"
        value: "tie"

Bildbeschriftung

yaml
annotation_schemes:
  - annotation_type: text
    name: caption
    description: "Write a caption for this image"
    textarea: true
    placeholder: "Describe what you see..."
    min_length: 10
    max_length: 300

Bildunterschrift-Qualitätsprüfung

yaml
data_files:
  - path: data/captions.json
    image_field: image_path
    text_field: generated_caption
 
annotation_schemes:
  - annotation_type: likert
    name: accuracy
    description: "How accurate is this caption?"
    size: 5
    min_label: "Very inaccurate"
    max_label: "Very accurate"
 
  - annotation_type: likert
    name: fluency
    description: "How natural is the language?"
    size: 5
    min_label: "Very awkward"
    max_label: "Very natural"
 
  - annotation_type: text
    name: improved_caption
    description: "Suggest a better caption (optional)"
    textarea: true

Anzeigeoptionen

Bildgröße

yaml
image:
  max_width: 800
  max_height: 600
  preserve_aspect_ratio: true

Zoom-Steuerung

yaml
image:
  zoom_enabled: true
  initial_zoom: fit  # 'fit', 'actual', or percentage

Vollbildmodus

yaml
image:
  fullscreen_enabled: true

Inhaltsmoderation

yaml
annotation_schemes:
  - annotation_type: radio
    name: safe_for_work
    description: "Is this image safe for work?"
    labels:
      - Safe
      - Questionable
      - Not Safe
 
  - annotation_type: multiselect
    name: violation_types
    description: "Select all violations (if any)"
    labels:
      - Violence
      - Adult content
      - Hate symbols
      - Graphic content
      - Spam/advertisement
    show_if:
      scheme: safe_for_work
      value: ["Questionable", "Not Safe"]

Unterstützte Formate

Gängige unterstützte Bildformate:

  • JPEG/JPG
  • PNG
  • GIF
  • WebP
  • BMP
yaml
image:
  allowed_formats: ["jpg", "jpeg", "png", "webp"]

Vollständiges Beispiel: Objekterkennungsüberprüfung

yaml
task_name: "Object Detection Verification"
 
image:
  enabled: true
  max_width: 1000
  zoom_enabled: true
 
data_files:
  - path: data/detections.json
    image_field: image_path
 
annotation_schemes:
  # Review pre-loaded predictions
  - annotation_type: bbox
    name: objects
    description: "Verify and correct object boxes"
    labels:
      - Person
      - Vehicle
      - Animal
      - Object
    load_predictions: true
    prediction_field: model_predictions
    label_colors:
      Person: "#3b82f6"
      Vehicle: "#10b981"
      Animal: "#f59e0b"
      Object: "#6b7280"
 
  # Overall assessment
  - annotation_type: radio
    name: prediction_quality
    description: "How accurate were the predictions?"
    labels:
      - All correct
      - Minor corrections needed
      - Major corrections needed
      - Mostly incorrect
 
  - annotation_type: number
    name: missed_objects
    description: "How many objects were missed?"
    min: 0
    max: 50
 
  - annotation_type: text
    name: notes
    description: "Any issues or comments?"
    textarea: true
    required: false

Leistungstipps

  1. Bildgröße optimieren – Große Bilder vor der Annotation verkleinern
  2. JPEG für Fotos verwenden – Kleinere Dateigrößen, schnelleres Laden
  3. PNG für Grafiken verwenden – Bessere Qualität für Diagramme/Screenshots
  4. Lazy Loading aktivieren – Für große Datensätze
  5. Vorschaubilder erwägen – Vorschauen in Listenansichten anzeigen
  6. Einheitlich vorverarbeiten – Größen und Formate normalisieren