Skip to content
Tutorials5 min read

Anotación de Clasificación de Géneros Musicales

Crea una tarea de anotación musical con visualización de forma de onda, vistas previas de 30 segundos y etiquetas jerárquicas de género.

Potato Team·

Anotación de Clasificación de Géneros Musicales

La clasificación de géneros musicales alimenta los sistemas de recomendación, la generación de listas de reproducción y el descubrimiento de música. Este tutorial muestra cómo construir interfaces de anotación para el etiquetado de géneros con vistas previas de audio y taxonomías jerárquicas.

Clasificación Básica de Géneros

yaml
annotation_task_name: "Music Genre Classification"
 
data_files:
  - data/tracks.json
 
item_properties:
  audio_path: audio_preview
  metadata_fields: [artist, title, album]
 
# Show track metadata
display:
  show_metadata: true
  metadata_template: "{{artist}} - {{title}}"
 
annotation_schemes:
  - annotation_type: audio_annotation
    audio_display: waveform
    waveform_color: "#EC4899"
    progress_color: "#F472B6"
    speed_control: true
    show_duration: true
    name: primary_genre
    description: "Select the primary genre"
    labels:
      - Rock
      - Pop
      - Hip-Hop
      - Electronic
      - Jazz
      - Classical
      - R&B
      - Country
      - Metal
      - Folk

Taxonomía Jerárquica de Géneros

yaml
annotation_schemes:
  # Top-level genre
  - annotation_type: radio
    name: genre
    description: "Primary genre"
    labels:
      - Rock
      - Electronic
      - Hip-Hop
      - Pop
      - Jazz
      - Classical
 
  # Subgenre (conditional)
  - annotation_type: radio
    name: subgenre
    description: "Subgenre"
    conditional:
      depends_on: genre
      options:
        Rock:
          - Classic Rock
          - Alternative
          - Indie Rock
          - Hard Rock
          - Punk
          - Grunge
        Electronic:
          - House
          - Techno
          - Trance
          - Drum & Bass
          - Ambient
          - Dubstep
        "Hip-Hop":
          - East Coast
          - West Coast
          - Trap
          - Conscious
          - Lo-fi
        Pop:
          - Synth Pop
          - Dance Pop
          - Indie Pop
          - K-Pop
          - Teen Pop
        Jazz:
          - Bebop
          - Smooth Jazz
          - Fusion
          - Big Band
          - Cool Jazz
        Classical:
          - Baroque
          - Romantic
          - Contemporary
          - Opera
          - Chamber

Etiquetado Multi-Etiqueta de Géneros

La música a menudo abarca múltiples géneros:

yaml
annotation_schemes:
  - annotation_type: multiselect
    name: genres
    description: "Select ALL applicable genres (up to 3)"
    labels:
      - Rock
      - Pop
      - Electronic
      - Hip-Hop
      - R&B
      - Jazz
      - Classical
      - Country
      - Folk
      - Metal
      - Punk
      - Indie
      - World
      - Latin
    min_selections: 1
    max_selections: 3
 
  - annotation_type: radio
    name: primary_genre
    description: "Which is the PRIMARY genre?"
    dynamic_from: genres  # Only show selected genres as options

Configuración Completa de Anotación Musical

yaml
annotation_task_name: "Music Tagging and Classification"
 
data_files:
  - data/music_library.json
 
item_properties:
  audio_path: preview_url
  metadata_fields:
    - artist
    - title
    - album
    - year
    - duration
 
display:
  show_metadata: true
  metadata_layout: card
  fields:
    - label: "Artist"
      field: artist
    - label: "Track"
      field: title
    - label: "Album"
      field: album
    - label: "Year"
      field: year
 
annotation_schemes:
  # Audio playback configuration
  - annotation_type: audio_annotation
    audio_display: waveform
    waveform_color: "#8B5CF6"
    progress_color: "#A78BFA"
    cursor_color: "#F59E0B"
    height: 100
    show_duration: true
    show_current_time: true
    speed_control: true
    speed_options: [0.75, 1.0, 1.25, 1.5]
    volume_control: true
    default_volume: 0.7
    autoplay: false
    loop: true
 
  # Genre classification
  - annotation_type: multiselect
    name: genres
    description: "Select all applicable genres"
    labels:
      - Rock
      - Pop
      - Electronic/Dance
      - Hip-Hop/Rap
      - R&B/Soul
      - Jazz
      - Classical
      - Country
      - Folk/Acoustic
      - Metal
      - Punk
      - Indie
      - World/International
      - Latin
      - Blues
      - Reggae
    min_selections: 1
    max_selections: 3
 
  # Mood/Energy
  - annotation_type: multiselect
    name: mood
    description: "Select the mood(s) of this track"
    labels:
      - Happy/Uplifting
      - Sad/Melancholic
      - Energetic
      - Calm/Relaxing
      - Aggressive
      - Romantic
      - Dark/Moody
      - Nostalgic
      - Motivational
    min_selections: 1
 
  # Energy level
  - annotation_type: likert
    name: energy
    description: "Energy level"
    size: 5
    min_label: "Very low energy"
    max_label: "Very high energy"
 
  # Danceability
  - annotation_type: likert
    name: danceability
    description: "How danceable is this track?"
    size: 5
    min_label: "Not danceable"
    max_label: "Very danceable"
 
  # Vocal content
  - annotation_type: radio
    name: vocals
    description: "Vocal content"
    labels:
      - Instrumental only
      - Mostly instrumental
      - Balanced
      - Mostly vocals
      - Vocals dominant
 
  # Era/Style
  - annotation_type: radio
    name: era
    description: "Musical era/style"
    labels:
      - Pre-1970s (Classic)
      - 1970s
      - 1980s
      - 1990s
      - 2000s
      - 2010s
      - 2020s/Current
 
  # Quality flags
  - annotation_type: multiselect
    name: flags
    description: "Special flags (if applicable)"
    labels:
      - Explicit content
      - Live recording
      - Remix/Cover
      - Holiday/Seasonal
      - Soundtrack
      - Spoken word sections
    required: false
 
  # Confidence
  - annotation_type: likert
    name: confidence
    description: "Confidence in your genre classification"
    size: 5
    min_label: "Uncertain"
    max_label: "Very confident"
 
annotation_guidelines:
  title: "Music Tagging Guidelines"
  content: |
    ## Genre Selection
    - Choose 1-3 genres that best describe the track
    - Select the most specific applicable genre
    - When unsure, lean toward broader categories
 
    ## Mood Assessment
    - Consider the overall emotional tone
    - A track can have multiple moods
    - Think about how the music makes you feel
 
    ## Energy & Danceability
    - Energy: tempo, intensity, power
    - Danceability: rhythm, beat clarity, groove
 
    ## Listening Tips
    - Listen to at least 30 seconds
    - Pay attention to drops, changes, and transitions
    - Consider instrumentation and production style
 

Formato de Salida

json
{
  "id": "track_001",
  "audio_url": "/audio/preview_001.mp3",
  "metadata": {
    "artist": "The Beatles",
    "title": "Here Comes the Sun",
    "album": "Abbey Road",
    "year": 1969
  },
  "annotations": {
    "genres": ["Rock", "Pop"],
    "mood": ["Happy/Uplifting", "Nostalgic"],
    "energy": 3,
    "danceability": 2,
    "vocals": "Balanced",
    "era": "Pre-1970s (Classic)",
    "flags": [],
    "confidence": 5
  }
}

Consejos para la Anotación Musical

  1. Auriculares de calidad: Esenciales para los matices de género
  2. Volumen consistente: Evita la fatiga por ajustes constantes
  3. Vista previa completa: Escucha lo suficiente para detectar cambios
  4. Conocimiento de géneros: Familiarízate con las definiciones de géneros
  5. Toma descansos: La fatiga auditiva afecta el juicio

Próximos Pasos


Documentación completa de audio en /docs/features/audio-annotation.