• DocumentCode
    3423831
  • Title

    New implementations of the E-HMM-based system for speaker diarization in meeting rooms

  • Author

    Fredouille, Corinne ; Evans, Nicholas

  • Author_Institution
    LIA, Univ. of Avignon, Avignon
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4357
  • Lastpage
    4360
  • Abstract
    This paper addresses the problem of speaker diarization in the specific context of meeting room recordings. Some new enhancements to the E-HMM-based speaker diarization system are reported. These involve a different approach to speaker modelling utilising EM/ML-based training rather than MAP adaptation as in our previous work. Using the new system we investigate the effects of speech activity detection through speaker diarization experiments conducted on 23 meetings extracted from the NIST/RT evaluation campaign datasets. We propose a new approach, which assigns confidence values according to the type of information carried by the signal and incorporates these values directly into the speaker diarization system. Experimental results show that, perhaps surprisingly, the non-speech segments do not systematically affect the robustness of the speaker diarization system, and more precisely the speaker model training process.
  • Keywords
    hidden Markov models; speaker recognition; speech processing; HMM-based speaker diarization system; meeting room recordings; nonspeech segments; speaker diarization system; speaker modelling; speech activity detection; Clustering algorithms; Data mining; Microphones; NIST; Protocols; Robustness; Shape; Speaker recognition; Speech analysis; Speech enhancement; confidence values; meeting rooms; speaker diarization; speaker recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518620
  • Filename
    4518620