Tutorials5 min read
Anotación de Clasificación de Géneros Musicales
Crea una tarea de anotación musical con visualización de forma de onda, vistas previas de 30 segundos y etiquetas jerárquicas de género.
Potato Team·
Anotación de Clasificación de Géneros Musicales
La clasificación de géneros musicales alimenta los sistemas de recomendación, la generación de listas de reproducción y el descubrimiento de música. Este tutorial muestra cómo construir interfaces de anotación para el etiquetado de géneros con vistas previas de audio y taxonomías jerárquicas.
Clasificación Básica de Géneros
yaml
annotation_task_name: "Music Genre Classification"
data_files:
- data/tracks.json
item_properties:
audio_path: audio_preview
metadata_fields: [artist, title, album]
# Show track metadata
display:
show_metadata: true
metadata_template: "{{artist}} - {{title}}"
annotation_schemes:
- annotation_type: audio_annotation
audio_display: waveform
waveform_color: "#EC4899"
progress_color: "#F472B6"
speed_control: true
show_duration: true
name: primary_genre
description: "Select the primary genre"
labels:
- Rock
- Pop
- Hip-Hop
- Electronic
- Jazz
- Classical
- R&B
- Country
- Metal
- FolkTaxonomía Jerárquica de Géneros
yaml
annotation_schemes:
# Top-level genre
- annotation_type: radio
name: genre
description: "Primary genre"
labels:
- Rock
- Electronic
- Hip-Hop
- Pop
- Jazz
- Classical
# Subgenre (conditional)
- annotation_type: radio
name: subgenre
description: "Subgenre"
conditional:
depends_on: genre
options:
Rock:
- Classic Rock
- Alternative
- Indie Rock
- Hard Rock
- Punk
- Grunge
Electronic:
- House
- Techno
- Trance
- Drum & Bass
- Ambient
- Dubstep
"Hip-Hop":
- East Coast
- West Coast
- Trap
- Conscious
- Lo-fi
Pop:
- Synth Pop
- Dance Pop
- Indie Pop
- K-Pop
- Teen Pop
Jazz:
- Bebop
- Smooth Jazz
- Fusion
- Big Band
- Cool Jazz
Classical:
- Baroque
- Romantic
- Contemporary
- Opera
- ChamberEtiquetado Multi-Etiqueta de Géneros
La música a menudo abarca múltiples géneros:
yaml
annotation_schemes:
- annotation_type: multiselect
name: genres
description: "Select ALL applicable genres (up to 3)"
labels:
- Rock
- Pop
- Electronic
- Hip-Hop
- R&B
- Jazz
- Classical
- Country
- Folk
- Metal
- Punk
- Indie
- World
- Latin
min_selections: 1
max_selections: 3
- annotation_type: radio
name: primary_genre
description: "Which is the PRIMARY genre?"
dynamic_from: genres # Only show selected genres as optionsConfiguración Completa de Anotación Musical
yaml
annotation_task_name: "Music Tagging and Classification"
data_files:
- data/music_library.json
item_properties:
audio_path: preview_url
metadata_fields:
- artist
- title
- album
- year
- duration
display:
show_metadata: true
metadata_layout: card
fields:
- label: "Artist"
field: artist
- label: "Track"
field: title
- label: "Album"
field: album
- label: "Year"
field: year
annotation_schemes:
# Audio playback configuration
- annotation_type: audio_annotation
audio_display: waveform
waveform_color: "#8B5CF6"
progress_color: "#A78BFA"
cursor_color: "#F59E0B"
height: 100
show_duration: true
show_current_time: true
speed_control: true
speed_options: [0.75, 1.0, 1.25, 1.5]
volume_control: true
default_volume: 0.7
autoplay: false
loop: true
# Genre classification
- annotation_type: multiselect
name: genres
description: "Select all applicable genres"
labels:
- Rock
- Pop
- Electronic/Dance
- Hip-Hop/Rap
- R&B/Soul
- Jazz
- Classical
- Country
- Folk/Acoustic
- Metal
- Punk
- Indie
- World/International
- Latin
- Blues
- Reggae
min_selections: 1
max_selections: 3
# Mood/Energy
- annotation_type: multiselect
name: mood
description: "Select the mood(s) of this track"
labels:
- Happy/Uplifting
- Sad/Melancholic
- Energetic
- Calm/Relaxing
- Aggressive
- Romantic
- Dark/Moody
- Nostalgic
- Motivational
min_selections: 1
# Energy level
- annotation_type: likert
name: energy
description: "Energy level"
size: 5
min_label: "Very low energy"
max_label: "Very high energy"
# Danceability
- annotation_type: likert
name: danceability
description: "How danceable is this track?"
size: 5
min_label: "Not danceable"
max_label: "Very danceable"
# Vocal content
- annotation_type: radio
name: vocals
description: "Vocal content"
labels:
- Instrumental only
- Mostly instrumental
- Balanced
- Mostly vocals
- Vocals dominant
# Era/Style
- annotation_type: radio
name: era
description: "Musical era/style"
labels:
- Pre-1970s (Classic)
- 1970s
- 1980s
- 1990s
- 2000s
- 2010s
- 2020s/Current
# Quality flags
- annotation_type: multiselect
name: flags
description: "Special flags (if applicable)"
labels:
- Explicit content
- Live recording
- Remix/Cover
- Holiday/Seasonal
- Soundtrack
- Spoken word sections
required: false
# Confidence
- annotation_type: likert
name: confidence
description: "Confidence in your genre classification"
size: 5
min_label: "Uncertain"
max_label: "Very confident"
annotation_guidelines:
title: "Music Tagging Guidelines"
content: |
## Genre Selection
- Choose 1-3 genres that best describe the track
- Select the most specific applicable genre
- When unsure, lean toward broader categories
## Mood Assessment
- Consider the overall emotional tone
- A track can have multiple moods
- Think about how the music makes you feel
## Energy & Danceability
- Energy: tempo, intensity, power
- Danceability: rhythm, beat clarity, groove
## Listening Tips
- Listen to at least 30 seconds
- Pay attention to drops, changes, and transitions
- Consider instrumentation and production style
Formato de Salida
json
{
"id": "track_001",
"audio_url": "/audio/preview_001.mp3",
"metadata": {
"artist": "The Beatles",
"title": "Here Comes the Sun",
"album": "Abbey Road",
"year": 1969
},
"annotations": {
"genres": ["Rock", "Pop"],
"mood": ["Happy/Uplifting", "Nostalgic"],
"energy": 3,
"danceability": 2,
"vocals": "Balanced",
"era": "Pre-1970s (Classic)",
"flags": [],
"confidence": 5
}
}Consejos para la Anotación Musical
- Auriculares de calidad: Esenciales para los matices de género
- Volumen consistente: Evita la fatiga por ajustes constantes
- Vista previa completa: Escucha lo suficiente para detectar cambios
- Conocimiento de géneros: Familiarízate con las definiciones de géneros
- Toma descansos: La fatiga auditiva afecta el juicio
Próximos Pasos
- Añade detección de eventos de audio para identificación de instrumentos
- Configura crowdsourcing para perspectivas diversas
- Aprende sobre métricas de acuerdo para música
Documentación completa de audio en /docs/features/audio-annotation.