• DocumentCode
    2800136
  • Title

    Leveraging speaker diarization for meeting recognition from distant microphones

  • Author

    Stolcke, Andreas ; Friedland, Gerald ; Imseng, David

  • Author_Institution
    SRI Int., Menlo Park, CA, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    4390
  • Lastpage
    4393
  • Abstract
    We investigate using state-of-the-art speaker diarization output for speech recognition purposes. While it seems obvious that speech recognition could benefit from the output of speaker diarization (“Who spoke when”) for effective feature normalization and model adaptation, such benefits have remained elusive in the very challenging domain of meeting recognition from distant microphones. In this study, we show that recognition gains are possible by careful post-processing of the diarization output. Still, recognition accuracy may suffer when the underlying diarization system performs worse than expected, even compared to far less sophisticated speaker-clustering techniques. We obtain a more accurate and robust overall system by combining recognition output with multiple speaker segmentations and clusterings. We evaluate our methods on data from the 2009 NIST Rich Transcription meeting recognition evaluation.
  • Keywords
    microphones; pattern clustering; speaker recognition; speech processing; NIST Rich Transcription meeting recognition evaluation; feature normalization; microphone; model adaptation; multiple speaker clustering; multiple speaker segmentation; speaker diarization; speech recognition; Adaptation model; Computer science; Error analysis; Loudspeakers; Microphones; NIST; Performance evaluation; Robustness; Speech processing; Speech recognition; meeting recognition; rich transcription; speaker diarization; speech processing; system combination;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495626
  • Filename
    5495626