• DocumentCode
    284617
  • Title

    Incorporating acoustic-phonetic knowledge in hybrid TDNN/HMM frameworks

  • Author

    Dugast, Christian ; Devillers, Laurence

  • Author_Institution
    Philips Res. Lab. Aachen, Germany
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    421
  • Abstract
    A comparison of several architectures of time-delayed neural networks (TDNN) as the preprocessing step for hidden Markov modeling (HMM) speaker-dependent continuous-speech recognition systems is presented. A modular TDNN architecture on the basis of acoustic-phonetic knowledge, where each sub-network is trained on a different subset of phonemes is defined. It allows the authors to define a hierarchical tree structure of sub-networks. This structure offers the possibility of proposing a framework to enlarge the number of outputs by defining context-dependent sub-networks. They also compare different methods for integrating TDNN in a HMM framework, a discrete and a continuous integration. For the speaker JWS-4 of the speaker-dependent DARPA RM1 database, with context independent phonemes, 21.3% word error rate are obtained without grammar, 4.6% with the DARPA word-pair grammar (perplexity of 60)
  • Keywords
    context-sensitive grammars; hidden Markov models; neural nets; speech recognition; DARPA word-pair grammar; acoustic-phonetic knowledge; context independent phonemes; context-dependent sub-networks; continuous integration; discrete integration; hidden Markov modeling; hierarchical tree structure; hybrid TDNN/HMM frameworks; modular TDNN architecture; number of outputs; perplexity; phoneme subsets; speaker JWS-4; speaker-dependent DARPA RM1 database; speaker-dependent continuous-speech recognition systems; time-delayed neural networks; Databases; Hidden Markov models; Intelligent networks; Laboratories; Neural networks; Power system modeling; Speech recognition; Topology; Tree data structures; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225882
  • Filename
    225882