• DocumentCode
    3250791
  • Title

    Speech recognition using dynamic neural networks

  • Author

    Botros, Nazeih M. ; Premnath, S.

  • Author_Institution
    Dept. of Electr. Eng., Southern Illinois Univ., Carbondale, IL, USA
  • Volume
    4
  • fYear
    1992
  • fDate
    7-11 Jun 1992
  • Firstpage
    737
  • Abstract
    The authors present an algorithm for isolated-word recognition that takes into consideration the duration variability of the different utterances of the same word. The algorithm is based on extracting acoustical features from the speech signal and using them as the input to a sequence of multilayer perceptron neural networks. The networks were implemented as predictors for the speech samples for a certain duration of time. The networks were trained by a combination of the back-propagation and the dynamic time warping (DTW) techniques. The DTW technique was implemented to normalize the duration variability. The networks were trained to recognize the correct words and to reject the wrong words. The training set consisted of ten words, each uttered seven times by three different speakers. The test set consisted of three utterances of each of the ten words. The results show that all these words could be recognized
  • Keywords
    dynamic programming; feedforward neural nets; learning (artificial intelligence); speech recognition; acoustical features; back-propagation; duration variability; dynamic neural networks; dynamic time warping; isolated-word recognition; multilayer perceptron neural networks; predictors; speech recognition; utterances; Automatic speech recognition; Cepstral analysis; Feature extraction; Linear predictive coding; Multilayer perceptrons; Neural networks; Pattern recognition; Signal processing algorithms; Speech recognition; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 1992. IJCNN., International Joint Conference on
  • Conference_Location
    Baltimore, MD
  • Print_ISBN
    0-7803-0559-0
  • Type

    conf

  • DOI
    10.1109/IJCNN.1992.227230
  • Filename
    227230