• DocumentCode
    2389930
  • Title

    Feature extraction from speech spectrograms using multi-layered network models

  • Author

    Palakal, Mathew J. ; Zoran, Michael J.

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Indiana Univ., Indianapolis, IN, USA
  • fYear
    1989
  • fDate
    23-25 Oct 1989
  • Firstpage
    224
  • Lastpage
    230
  • Abstract
    The authors propose a method for capturing speaker-invariant features from speech spectrograms using artificial neural network (ANN) models. Feature extraction was carried out using the recognition network model, a biologically based pattern recognition system capable of recognizing images that are distorted or shifted in position. It is a three-layer network system. The initial layer of the network extracts small features; each advancing layer looks for larger and larger features. As pattern information progresses through the network, slight distortions and shifts in position are allowed. The proposed network model was used to learn vowel features from six different vowel and diphthong sounds in English. Initial test results show that the network model is capable of learning all important features that are present in the pattern studied
  • Keywords
    neural nets; speech analysis and processing; speech recognition; artificial neural network; biologically based pattern recognition system; diphthong sounds; feature extraction; multi-layered network models; pattern information; recognition network model; speaker-invariant features; speech spectrograms; vowel features; Artificial neural networks; Automatic speech recognition; Computer networks; Feature extraction; Hidden Markov models; Image processing; Layout; Prototypes; Spectrogram; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools for Artificial Intelligence, 1989. Architectures, Languages and Algorithms, IEEE International Workshop on
  • Conference_Location
    Fairfax, VA
  • Print_ISBN
    0-8186-1984-8
  • Type

    conf

  • DOI
    10.1109/TAI.1989.65324
  • Filename
    65324