• DocumentCode
    164839
  • Title

    Ensemble integration of calibrated speaker localization and statistical speech detection in domestic environments

  • Author

    Tachioka, Yuuki ; Narita, T. ; Watanabe, Shigetaka ; Le Roux, Jonathan

  • Author_Institution
    Inf. Technol. R&D Center, Mitsubishi Electr. Corp., Kamakura, Japan
  • fYear
    2014
  • fDate
    12-14 May 2014
  • Firstpage
    162
  • Lastpage
    166
  • Abstract
    This paper describes speaker localization and speech detection techniques for domestic environments. In real environments, it is hard to localize speakers because reverberation causes discrepancy from the simple spherical wave assumption. We propose a template-based method that calibrates the localization errors included in conventional methods. In addition, we use statistical speech detection methods to deal with noises. However, in this challenge, there are five rooms and leaked utterances from other rooms must be rejected. This kind of rejection is hard to perform by only using speech detection results. To address this problem, we also propose a method that integrates speech localization and speech detection using a minimum cost criterion or a classifier-based strategy. The proposed method achieved an accuracy of 0.712 for speaker localization and an F value of 0.743 for speech detection on the development set compared with the baseline 0.559 and 0.570, and 0.666 and 0.706 on the test set compared with the baseline 0.517 and 0.602.
  • Keywords
    calibration; pattern classification; speaker recognition; statistical analysis; F value; calibrated speaker localization; classifier-based strategy; domestic environments; leaked utterances; localization error calibration; minimum cost criterion; spherical wave assumption; statistical speech detection method; template-based method; Artificial neural networks; Iron; Microphone arrays; Noise; Speech; Support vector machines; Speaker localization; calibration; rejection; speech detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Hands-free Speech Communication and Microphone Arrays (HSCMA), 2014 4th Joint Workshop on
  • Conference_Location
    Villers-les-Nancy
  • Type

    conf

  • DOI
    10.1109/HSCMA.2014.6843272
  • Filename
    6843272