• DocumentCode
    2996041
  • Title

    Isolated word recognition using hidden Markov models

  • Author

    Sugawara, Kazuhide ; Nishimura, Masafumi ; Toshioka, Koichi ; Okochi, Masaaki ; Kaneko, Tovohisa

  • Author_Institution
    Science Institute, IBM Japan Ltd., Japan
  • Volume
    10
  • fYear
    1985
  • fDate
    31138
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In this paper we investigated smoothing techniques for alleviating the problem due to insufficient amount of training data. Hidden Markov Models (HMM) require a large amount of training data to obtain reliable probability estimates. But for isolated word recognition (100 words or more), we can not expect a user to speak each word more than several times. We found that the confusion matrix between a pair of label prototypes was particularly effective for the problem. We investigated two ways of computing the confusion matrix. One is based on distance among labels, and the other is based on the correspondence of labels in several utterances of the same word. Performance of these techniques was tested by using 100 Japanese city names spoken in an isolated word mode by three speakers. It was found that the smoothing technique reduced recognition errors from 1% to 0.1%. To visualize such performance improvement, we used, together with recognition rate, "two-dimensional score plot," which shows the distribution of the best score of the true word and that of the remaining false ones in the vocabulary.
  • Keywords
    Band pass filters; Cities and towns; Decoding; Discrete Fourier transforms; Hidden Markov models; Prototypes; Smoothing methods; Testing; Training data; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '85.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1985.1168452
  • Filename
    1168452