DocumentCode :
2310420
Title :
Robust Speaker Recognition Using Binary Time-Frequency Masks
Author :
Shao, Yang ; Wang, DeLiang
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ.
Volume :
1
fYear :
2006
fDate :
14-19 May 2006
Abstract :
Conventional speaker recognition systems perform poorly under noisy conditions. In this paper, we evaluate binary time-frequency masks for robust speaker recognition. An ideal binary mask is a priori defined as a binary matrix where 1 indicates that the target is stronger than the interference within the corresponding time-frequency unit and 0 indicates otherwise. We perform speaker identification and verification using a missing data recognizer under cochannel and other noise conditions, and show that the ideal binary mask provides large performance gains. By employing a speech segregation system that estimates the ideal binary mask, we achieve significant improvements over alternative approaches. Our study, thus, demonstrates that the use of binary masking represents a promising direction for robust speaker recognition
Keywords :
matrix algebra; speaker recognition; time-frequency analysis; binary matrix; binary time-frequency masks; data recognizer; robust speaker recognition; speech segregation system; Acoustic noise; Interference; Loudspeakers; Noise robustness; Performance gain; Speaker recognition; Speech coding; Speech enhancement; Speech recognition; Time frequency analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1660103
Filename :
1660103
Link To Document :
بازگشت