Title :
Employing heterogeneous information in a multi-stream framework
Author :
Christensen, Heidi ; Lindberg, Borge ; Andersen, Ove
Author_Institution :
Center for PersonKommunikation, Aalborg Univ., Denmark
Abstract :
A multi-stream speech recogniser is based on the combination of multiple feature streams each containing complementary information. In the past, multi-stream research has typically focused on systems that use a single feature extraction method. This heritage from conventional speech recognisers is an unnecessary restriction and both psychoacoustic and phonetic knowledge strongly motivate the use of heterogeneous features. In this paper we investigate how heterogeneous processing can be used in two different multi-stream configurations: first, a system where each stream handles a different frequency region of the speech (a multi-band recogniser) and, second a multi-stream recogniser where each stream handles the full frequency region. For each type of system we compare the performance using both homogeneous and heterogeneous processing. We demonstrate that the use of heterogeneous information significantly improves the clean speech recognition performance motivating us to continue exploring more specifically designed stream processing
Keywords :
feature extraction; speech recognition; clean speech recognition; heterogeneous features; heterogeneous information; multi-band recogniser; multi-stream framework; multi-stream speech recogniser; multiple feature stream; performance; Automatic speech recognition; Decoding; Feature extraction; Frequency; Hidden Markov models; Process design; Psychology; Signal processing; Speech processing; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
0-7803-6293-4
DOI :
10.1109/ICASSP.2000.861977