• DocumentCode
    2451897
  • Title

    Dual-microphone based binary mask estimation for robust speaker verification

  • Author

    Zhao, Yali ; Fu, Zhong-Hua ; Xie, Lei ; Zhang, Jian ; Zhang, Yanning

  • Author_Institution
    Shaanxi Provincial Key Lab. of Speech & Image Inf. Process., Northwestern Polytech. Univ., Xian, China
  • fYear
    2012
  • fDate
    16-18 July 2012
  • Firstpage
    1014
  • Lastpage
    1019
  • Abstract
    Missing feature theory (MFT) has shown great potential for robust speaker recognition in noisy environments. Accurate estimation of binary mask is crucial in MFT-based speaker recognition. This paper addresses the speaker verification problem using MFT in a practical scenario: the location of target speaker is fixed while the locations of noise interferences are unknown. Specifically, we propose a dual-microphone semi-blind approach to estimate the binary mask. During system initialization, a spatial location model for the target is trained precisely. Then a spatial model for corrupted speech is obtained on-line by model adaptation. Finally, the binary mask is estimated by likelihood comparison. Moreover, we propose a reliable frame selection method to further focus on the reliable speech frames for missing data speaker recognition. Experimental results demonstrate that our proposed approach achieves substantial improvements in recognition performance in both white noise and speech corrupted conditions.
  • Keywords
    speaker recognition; MFT-based speaker recognition; dual-microphone based binary mask estimation; dual-microphone semi-blind approach; missing feature theory; noisy environments; reliable frame selection method; robust speaker verification; speaker verification problem; system initialization; Estimation; Noise; Robustness; Speaker recognition; Speech; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Audio, Language and Image Processing (ICALIP), 2012 International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4673-0173-2
  • Type

    conf

  • DOI
    10.1109/ICALIP.2012.6376764
  • Filename
    6376764