مرکز منطقه ای اطلاع رساني علوم و فناوري - Multistream robust speaker recognition based on speech intelligibility

DocumentCode :

3118766

Title :

Multistream robust speaker recognition based on speech intelligibility

Author :

Nemala, Sridhar Krishna ; Elhilali, Mounya

Author_Institution :

Dept. of Electr. & Comput. Eng., Johns Hopkins Univ., Baltimore, MD, USA

fYear :

2011

fDate :

23-25 March 2011

Firstpage :

Lastpage :

Abstract :

Delimiting the most informative voice segments of an acoustic signal is often a crucial initial step for any speech processing system. In the current work, we propose a novel segmentation approach based on a perception-based measure of speech intelligibility. Unlike segmentation approaches based on various forms of voice-activity detection (VAD), the proposed segmentation approach exploits higher-level perceptual information about the signal intelligibility levels. This classification based on intelligibility estimates is integrated into a novel multistream framework for automatic speaker recognition task. The multistream system processes the input acoustic signal along multiple independent streams reflecting various levels of intelligibility and then fusing the decision scores from the multiple steams according to their intelligibility contribution. Our results show that the proposed multistream system achieves significant improvements both in clean and noisy conditions when compared with a baseline and a state-of-the-art voice-activity detection algorithm.

Keywords :

speaker recognition; speech intelligibility; acoustic signal; automatic speaker recognition task; informative voice segments; multistream robust speaker recognition; speech intelligibility; voice-activity detection; Computational modeling; Noise; Robustness;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Information Sciences and Systems (CISS), 2011 45th Annual Conference on

Conference_Location :

Baltimore, MD

Print_ISBN :

978-1-4244-9846-8

Electronic_ISBN :

978-1-4244-9847-5

Type :

conf

DOI :

10.1109/CISS.2011.5766105

Filename :

5766105

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3118766