• DocumentCode
    311026
  • Title

    Hybrid HMM/ANN systems for training independent tasks: experiments on Phonebook and related improvements

  • Author

    Dupont, Stéphane ; Bourlard, Hervé ; Deroo, Olivier ; Fontaine, Vincent ; Boite, Jean-Marc

  • Author_Institution
    Faculte Polytechnique de Mons, Belgium
  • Volume
    3
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    1767
  • Abstract
    In this paper, we evaluate multi-Gaussian HMM systems and hybrid HMM/ANN systems in the framework of task independent training for small size (75 words) and medium size (600 words) vocabularies. To do this, we use the Phonebook database (Pitrelli et al., 1995) which is particularly well suited to this kind of experiment since (1) it is a very large telephone database and (2) the size and content of the test vocabulary is very flexible. For each system, different HMM topologies are compared to test the influence of state tying (with a number of parameters approximately kept constant) on the recognition performance. Two lexica (Phonebook and CMU) are also compared and it is shown that the CMU lexicon leads to significantly better performance. Finally, it is shown that with a quite simple system and a few adaptations to the basic HMM/ANN scheme, recognition performance of 98.5% and 94.7% can easily be achieved, respectively on a lexicon of 75 and 600 words (isolated words, telephone speech and lexicon words not present in the training data)
  • Keywords
    Gaussian processes; hidden Markov models; learning (artificial intelligence); neural nets; speech recognition; CMU lexicon; HMM topologies; Phonebook; hybrid HMM/ANN systems; independent tasks; isolated words; lexica; lexicon words; medium size vocabularies; multi-Gaussian HMM systems; recognition performance; small size vocabularies; state tying; task independent training; telephone database; telephone speech; test vocabulary; training; Artificial neural networks; Automatic speech recognition; Context modeling; Databases; Hidden Markov models; System testing; Telephony; Topology; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.598872
  • Filename
    598872