DocumentCode :
3118766
Title :
Multistream robust speaker recognition based on speech intelligibility
Author :
Nemala, Sridhar Krishna ; Elhilali, Mounya
Author_Institution :
Dept. of Electr. & Comput. Eng., Johns Hopkins Univ., Baltimore, MD, USA
fYear :
2011
fDate :
23-25 March 2011
Firstpage :
1
Lastpage :
5
Abstract :
Delimiting the most informative voice segments of an acoustic signal is often a crucial initial step for any speech processing system. In the current work, we propose a novel segmentation approach based on a perception-based measure of speech intelligibility. Unlike segmentation approaches based on various forms of voice-activity detection (VAD), the proposed segmentation approach exploits higher-level perceptual information about the signal intelligibility levels. This classification based on intelligibility estimates is integrated into a novel multistream framework for automatic speaker recognition task. The multistream system processes the input acoustic signal along multiple independent streams reflecting various levels of intelligibility and then fusing the decision scores from the multiple steams according to their intelligibility contribution. Our results show that the proposed multistream system achieves significant improvements both in clean and noisy conditions when compared with a baseline and a state-of-the-art voice-activity detection algorithm.
Keywords :
speaker recognition; speech intelligibility; acoustic signal; automatic speaker recognition task; informative voice segments; multistream robust speaker recognition; speech intelligibility; voice-activity detection; Computational modeling; Noise; Robustness;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Sciences and Systems (CISS), 2011 45th Annual Conference on
Conference_Location :
Baltimore, MD
Print_ISBN :
978-1-4244-9846-8
Electronic_ISBN :
978-1-4244-9847-5
Type :
conf
DOI :
10.1109/CISS.2011.5766105
Filename :
5766105
Link To Document :
بازگشت