DocumentCode
2155459
Title
Robust speech recognition in a high interference real room environment using blind speech extraction
Author
Koutras, A. ; Dermatas, E.
Author_Institution
Electr. & Comput. Eng. Dept., Patras Univ., Greece
Volume
1
fYear
2002
fDate
2002
Firstpage
167
Abstract
We present a novel blind signal extraction (BSE) method for robust speech recognition in a real room environment under the coexistence of simultaneous interfering non-speech sources. The proposed method is capable of extracting the target speaker´s voice based on a maximum kurtosis criterion. Extensive phoneme recognition experiments have proved the proposed network´s efficacy when used in a real-life situation of a talking speaker with the coexistence of various non-speech sources (e.g. music and noise), achieving a phoneme recognition improvement of about 23%, especially under high interference. Furthermore, comparison of the proposed network to known blind source separation networks, commonly used in similar situations, showed lower computational complexity and better recognition accuracy of the BSE network, making it ideal to be used as a front-end to existing ASR systems.
Keywords
acoustic noise; blind source separation; computational complexity; speech recognition; statistical analysis; ASR; acoustic interference; automatic speech recognition; blind signal extraction; blind source separation; blind speech extraction; cocktail party effect; computational complexity; high interference real room environment; maximum kurtosis criterion; statistical analysis; Automatic speech recognition; Blind source separation; Data mining; Delay; Interference; Loudspeakers; Microphones; Robustness; Source separation; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Signal Processing, 2002. DSP 2002. 2002 14th International Conference on
Print_ISBN
0-7803-7503-3
Type
conf
DOI
10.1109/ICDSP.2002.1027867
Filename
1027867
Link To Document