• DocumentCode
    323583
  • Title

    ACID/HNN: clustering hierarchies of neural networks for context-dependent connectionist acoustic modeling

  • Author

    Fritsch, Jürgen ; Fïnke, Michael

  • Author_Institution
    Interactive Syst. Labs., Karlsruhe Univ., Germany
  • Volume
    1
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    505
  • Abstract
    We present the ACID/HNN framework, a principled approach to hierarchical connectionist acoustic modeling in large vocabulary conversational speech recognition (LVCSR). Our approach consists of an agglomerative clustering algorithm based on information divergence (ACID) to automatically design and robustly estimate hierarchies of neural networks (HNN) for arbitrarily large sets of context-dependent decision tree clustered HMM states. We argue that a hierarchical approach is crucial in applying locally discriminative connectionist models to the typically very large state spaces observed in LVCSR systems. We evaluate the ACID/HNN framework on the Switchboard conversational telephone speech corpus. Furthermore, we focus on the benefits of the proposed connectionist acoustic model, namely exploiting the hierarchical structure for speaker adaptation and decoding speed-up algorithms
  • Keywords
    decoding; estimation theory; neural nets; speech recognition; ACID; ACID/HNN; HNN; LVCSR; Switchboard conversational telephone speech corpus; agglomerative clustering algorithm based on information divergence; clustering hierarchies; context-dependent connectionist acoustic modeling; context-dependent decision tree clustered HMM states; hierarchical connectionist acoustic modeling; hierarchical structure; large vocabulary conversational speech recognition; locally discriminative connectionist models; neural networks; speaker adaptation; speed-up algorithms decoding; Algorithm design and analysis; Clustering algorithms; Decision trees; Hidden Markov models; Neural networks; Robustness; Speech recognition; State estimation; State-space methods; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.674478
  • Filename
    674478