• DocumentCode
    907319
  • Title

    Phoneme recognition using time-delay neural networks

  • Author

    Waibel, Alexander ; Hanazawa, Toshiyuki ; Hinton, Geoffrey ; Shikano, Kiyohiro ; Lang, Kevin J.

  • Author_Institution
    Dept. of Comput. Sci., Carnegie-Mellon Univ., Pittsburgh, PA, USA
  • Volume
    37
  • Issue
    3
  • fYear
    1989
  • fDate
    3/1/1989 12:00:00 AM
  • Firstpage
    328
  • Lastpage
    339
  • Abstract
    The authors present a time-delay neural network (TDNN) approach to phoneme recognition which is characterized by two important properties: (1) using a three-layer arrangement of simple computing units, a hierarchy can be constructed that allows for the formation of arbitrary nonlinear decision surfaces, which the TDNN learns automatically using error backpropagation; and (2) the time-delay arrangement enables the network to discover acoustic-phonetic features and the temporal relationships between them independently of position in time and therefore not blurred by temporal shifts in the input. As a recognition task, the speaker-dependent recognition of the phonemes B, D, and G in varying phonetic contexts was chosen. For comparison, several discrete hidden Markov models (HMM) were trained to perform the same task. Performance evaluation over 1946 testing tokens from three speakers showed that the TDNN achieves a recognition rate of 98.5% correct while the rate obtained by the best of the HMMs was only 93.7%
  • Keywords
    neural nets; speech recognition; computing units; error backpropagation; hidden Markov models; nonlinear decision surfaces; phoneme recognition; speech; temporal shifts; testing tokens; three-layer; time-delay neural networks; Acoustic testing; Backpropagation; Character recognition; Computer networks; Computer science; Hidden Markov models; Loudspeakers; Neural networks; Psychology; Speech recognition;
  • fLanguage
    English
  • Journal_Title
    Acoustics, Speech and Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0096-3518
  • Type

    jour

  • DOI
    10.1109/29.21701
  • Filename
    21701