• DocumentCode
    3162314
  • Title

    Speaker diarization of meetings based on large TDOA feature vectors

  • Author

    Vijayasenan, Deepu ; Valente, Fabio

  • Author_Institution
    Univ. des Saarlandes, Saarbrucken, Germany
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4173
  • Lastpage
    4176
  • Abstract
    This paper investigates the use of large TDOA feature vectors together with acoustic information in speaker diarization of meetings. TDOAs are obtained by considering all possible microphones pairs and this approach is compared with conventional TDOA features extracted w.r.t. a reference channel. The study is carried using two systems, the first based on Gaussian Mixture Modeling and the second based on the Information Bottleneck approach. Results on NIST RT06/RT07/RT09 evaluation datasets show a large speaker error reduction of 30% relative going from 14.3% to 10.8% for the first and from 12.3% to 8.2% for the second whenever the feature weighting is properly handled. Furthermore results reveal that the IB system is more robust to different number of microphones even when all pairs large TDOA vectors are used thus outperforming the HMM/GMM by 25% relative (8.2% error compared to 10.8%).
  • Keywords
    Gaussian processes; speaker recognition; time-of-arrival estimation; Gaussian mixture modeling; NIST RT06/RT07/RT09 evaluation dataset; acoustic information; information bottleneck; large TDOA feature vectors; meeting diarization; microphones pair; reference channel; speaker diarization; Acoustics; Delay; Hidden Markov models; Microphones; NIST; Speech; Vectors; Meetings Recordings; Model combination; Speaker diarization; Time Delay Of Arrival features;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288838
  • Filename
    6288838