• DocumentCode
    2296958
  • Title

    Speech recognition using temporally connected kernels in mixture density hidden Markov models

  • Author

    Somervuo, Panu

  • Author_Institution
    Neural Networks Res. Centre, Helsinki Univ. of Technol., Espoo, Finland
  • Volume
    6
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    3434
  • Abstract
    A method is presented for speeding up the performance of the HMM based speech recognition system where the states are modeled by a large number of Gaussian kernels. The emission probabilities of the states are usually dominated by the nearest Gaussians to the input vector. The speedup is gained without deteriorating the recognition accuracy by concentrating on these kernels in the reduced K-best-kernel search. In this work, the time information of the input is encoded to the connections of the kernels. The search for the dominating kernels is then performed along the kernel connections which model the trajectories of the speech in the feature space. In the experiments, speaker-dependent speech recognizers were trained for ten speakers. The number of distance computations between feature vectors and kernel mean vectors was reduced 75% without increasing the average phoneme recognition error, which was 5.7% for the baseline system
  • Keywords
    Gaussian distribution; hidden Markov models; speech recognition; Gaussian kernels; HMM based speech recognition system; PDF; average phoneme recognition error; emission probabilities; experiments; feature space; feature vectors; kernel mean vectors; mixture density hidden Markov models; performance; recognition accuracy; reduced K-best-kernel search; speaker-dependent speech recognizers; speech trajectories; temporally connected kernels; time information; Artificial neural networks; Hidden Markov models; Intelligent networks; Kernel; Neural networks; Pattern recognition; Probability density function; Self organizing feature maps; Speech recognition; Unsupervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.860139
  • Filename
    860139