Showcase/Environmental Sound Classification
beginneraudio

Environmental Sound Classification

Classify environmental sounds into categories following UrbanSound8K and ESC-50 datasets.

🎧

audio annotation

Configuration Fileconfig.yaml

task_name: "Environmental Sound Classification"

# Server configuration
server:
  port: 8000

# Audio settings
audio:
  enabled: true
  display: waveform
  waveform_color: "#10B981"
  progress_color: "#34D399"
  speed_control: true

# Data configuration
data_files:
  - path: data/environmental_sounds.json
    audio_field: audio_file

# Annotation schemes
annotation_schemes:
  # Primary sound category (UrbanSound8K classes)
  - annotation_type: radio
    name: sound_category
    description: "Select the primary sound category"
    labels:
      - Air conditioner
      - Car horn
      - Children playing
      - Dog bark
      - Drilling
      - Engine idling
      - Gun shot
      - Jackhammer
      - Siren
      - Street music
    keyboard_shortcuts:
      "Air conditioner": "1"
      "Car horn": "2"
      "Children playing": "3"
      "Dog bark": "4"
      "Drilling": "5"
      "Engine idling": "6"
      "Gun shot": "7"
      "Jackhammer": "8"
      "Siren": "9"
      "Street music": "0"

  # Sound prominence
  - annotation_type: radio
    name: prominence
    description: "How prominent is the target sound?"
    labels:
      - Dominant (clearly the main sound)
      - Present (audible but not dominant)
      - Background (barely audible)
      - Absent (wrong label)

  # Audio quality
  - annotation_type: radio
    name: audio_quality
    description: "Rate the recording quality"
    labels:
      - Clear (easy to identify)
      - Moderate (some noise)
      - Poor (difficult to identify)
      - Unusable (cannot determine)

  # Confidence
  - annotation_type: likert
    name: confidence
    description: "How confident are you in your classification?"
    size: 5
    min_label: "Guessing"
    max_label: "Certain"

# User settings
allow_all_users: true
instances_per_annotator: 300

# Output
output:
  path: annotations/
  format: json

Get This Design

This design is available in our showcase. Copy the configuration below to get started.

Quick start:

# Create your project folder
mkdir environmental-sound-classification
cd environmental-sound-classification
# Copy config.yaml from above
potato start config.yaml

Details

Annotation Types

radio

Domain

Audio

Use Cases

sound classificationenvironmental audiourban sound analysis

Tags

audioenvironmentalurbansoundesc-50