DocumentCode :
311026
Title :
Hybrid HMM/ANN systems for training independent tasks: experiments on Phonebook and related improvements
Author :
Dupont, Stéphane ; Bourlard, Hervé ; Deroo, Olivier ; Fontaine, Vincent ; Boite, Jean-Marc
Author_Institution :
Faculte Polytechnique de Mons, Belgium
Volume :
3
fYear :
1997
fDate :
21-24 Apr 1997
Firstpage :
1767
Abstract :
In this paper, we evaluate multi-Gaussian HMM systems and hybrid HMM/ANN systems in the framework of task independent training for small size (75 words) and medium size (600 words) vocabularies. To do this, we use the Phonebook database (Pitrelli et al., 1995) which is particularly well suited to this kind of experiment since (1) it is a very large telephone database and (2) the size and content of the test vocabulary is very flexible. For each system, different HMM topologies are compared to test the influence of state tying (with a number of parameters approximately kept constant) on the recognition performance. Two lexica (Phonebook and CMU) are also compared and it is shown that the CMU lexicon leads to significantly better performance. Finally, it is shown that with a quite simple system and a few adaptations to the basic HMM/ANN scheme, recognition performance of 98.5% and 94.7% can easily be achieved, respectively on a lexicon of 75 and 600 words (isolated words, telephone speech and lexicon words not present in the training data)
Keywords :
Gaussian processes; hidden Markov models; learning (artificial intelligence); neural nets; speech recognition; CMU lexicon; HMM topologies; Phonebook; hybrid HMM/ANN systems; independent tasks; isolated words; lexica; lexicon words; medium size vocabularies; multi-Gaussian HMM systems; recognition performance; small size vocabularies; state tying; task independent training; telephone database; telephone speech; test vocabulary; training; Artificial neural networks; Automatic speech recognition; Context modeling; Databases; Hidden Markov models; System testing; Telephony; Topology; Training data; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
ISSN :
1520-6149
Print_ISBN :
0-8186-7919-0
Type :
conf
DOI :
10.1109/ICASSP.1997.598872
Filename :
598872
Link To Document :
بازگشت