Speaker Diarization
Segment and label speakers in multi-party conversations following AMI Meeting Corpus guidelines.
获取此设计
This design is available in our showcase. Copy the configuration below to get started.
快速开始:
# Create your project folder mkdir speaker-diarization cd speaker-diarization # Copy config.yaml from above potato start config.yaml
详情
标注类型
领域
应用场景
标签
相关设计
DISPLACE 2024 - Speaker and Language Diarization
Speaker and language diarization in multilingual conversational audio. Annotators mark speaker turn boundaries, identify speakers, and label the language of each segment in conversational environments (Kundu et al., INTERSPEECH 2024).
ToBI Prosodic Annotation
Multi-tier prosodic annotation following the Tones and Break Indices (ToBI) framework. Annotators label pitch accents, phrase accents, boundary tones, and break indices on speech utterances, producing a layered prosodic transcription aligned to the audio timeline (Silverman et al., Speech Communication 1992).
Adverse Drug Event Extraction (CADEC)
Named entity recognition for adverse drug events from patient-reported experiences, based on the CADEC corpus (Karimi et al., 2015). Annotates drugs, adverse effects, symptoms, diseases, and findings from colloquial health forum posts with mapping to medical vocabularies (SNOMED-CT, MedDRA).