• DocumentCode
    3528154
  • Title

    Speaker diarization in meeting audio

  • Author

    Nwe, Tin Lay ; Sun, Hanwu ; Li, Haizhou ; Rahardja, Susanto

  • Author_Institution
    Inst. for Infocomm Res. (I2R), A*STAR, Singapore
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4073
  • Lastpage
    4076
  • Abstract
    This paper describes speaker diarization system on a NIST Rich Transcription 2007 (RT-07) meeting recognition evaluation data set for the task of multiple distant microphone (MDM). Our implementation includes three components: initial clustering, non-speech removal and cluster purification. Initial clusters are generated using directional of arrival (DOA) information and bootstrap clustering. Multiple GMM modeling for speech/non-speech classification is employed for non-speech removal component. In addition, a novel system fusion strategy using information from receiver operating curve (ROC) is proposed for non-speech removal component. Finally, consensus clustering approach together with iterative GMM clustering method is employed for speaker cluster purification. The system achieves the overall DER of 10.81%.
  • Keywords
    direction-of-arrival estimation; pattern classification; pattern clustering; speaker recognition; GMM modeling; NIST Rich Transcription 2007 meeting recognition evaluation data set; bootstrap clustering; consensus clustering approach; directional of arrival; meeting audio; multiple distant microphone; nonspeech classification; nonspeech removal component; receiver operating curve; speaker cluster purification; speaker diarization system; system fusion strategy; Adaptive filters; Conferences; Direction of arrival estimation; Erbium; Machine learning; Natural languages; Purification; Speech processing; Sun; Tin; Meetings; clustering methods; modeling; pattern classification; speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960523
  • Filename
    4960523