DocumentCode
2310420
Title
Robust Speaker Recognition Using Binary Time-Frequency Masks
Author
Shao, Yang ; Wang, DeLiang
Author_Institution
Dept. of Comput. Sci. & Eng., Ohio State Univ.
Volume
1
fYear
2006
fDate
14-19 May 2006
Abstract
Conventional speaker recognition systems perform poorly under noisy conditions. In this paper, we evaluate binary time-frequency masks for robust speaker recognition. An ideal binary mask is a priori defined as a binary matrix where 1 indicates that the target is stronger than the interference within the corresponding time-frequency unit and 0 indicates otherwise. We perform speaker identification and verification using a missing data recognizer under cochannel and other noise conditions, and show that the ideal binary mask provides large performance gains. By employing a speech segregation system that estimates the ideal binary mask, we achieve significant improvements over alternative approaches. Our study, thus, demonstrates that the use of binary masking represents a promising direction for robust speaker recognition
Keywords
matrix algebra; speaker recognition; time-frequency analysis; binary matrix; binary time-frequency masks; data recognizer; robust speaker recognition; speech segregation system; Acoustic noise; Interference; Loudspeakers; Noise robustness; Performance gain; Speaker recognition; Speech coding; Speech enhancement; Speech recognition; Time frequency analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1660103
Filename
1660103
Link To Document