• DocumentCode
    3074030
  • Title

    A speaker independent word recognition system based on phoneme recognition for a large size (212 words) vocabulary

  • Author

    Makino, Shozo ; Kido, Ken´iti

  • Author_Institution
    Tohoku University, Sendai, Japan
  • Volume
    9
  • fYear
    1984
  • fDate
    30742
  • Firstpage
    29
  • Lastpage
    32
  • Abstract
    This paper describes the speaker-independent spoken word recognition system for a large size vocabulary. Speech is analyzed by the filter bank, from whose logarithmic spectrum the 11 features are extracted every 10 ms. Using the features the speech is first segmented and the primary phoneme recognition is carried out for every segment using the Bayes decision method. After correcting errors in segmentation and phoneme recognition, the secondary recognition of part of the consonants is carried out and the phonemic sequence is determined. The word dictionary item having maximum likelihood to the sequence is chosen as the recognition output. The 75.9% score for the phoneme recognition and the 92.4% score for the word recognition are obtained for the training samples in the 212 words uttered by 10 male and 10 female speakers. For the same words uttered by 30 male and 20 female speakers different from the above speakers, the 88.1% word recognition score is obtained.
  • Keywords
    Data mining; Dictionaries; Error correction; Feature extraction; Filter bank; Frequency; Pattern recognition; Speech analysis; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '84.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1984.1172568
  • Filename
    1172568