Improving speaker diarization for naturalistic child-adult conversational interactions using contextual information. Academic Article uri icon

Overview

abstract

  • While deep learning has driven recent improvements in audio speaker diarization, it often faces performance issues in challenging interaction scenarios and varied acoustic settings such as between a child and adult (caregiver/examiner). In this work, the role of contextual factors that affect diarization performance in such interactions is analyzed. Factors that affect each type of diarization error are identified. Furthermore, a DNN is trained on diarization outputs in conjunction with the factors to improve diarization performance. The results demonstrate the usefulness of incorporating context in improving diarization performance of child-adult interactions in clinical settings.

publication date

  • February 1, 2020

Research

keywords

  • Acoustics
  • Communication

Identity

PubMed Central ID

  • PMC7030978

Scopus Document Identifier

  • 85080086450

Digital Object Identifier (DOI)

  • 10.1121/10.0000736

PubMed ID

  • 32113327

Additional Document Info

volume

  • 147

issue

  • 2