• DocumentCode
    2646382
  • Title

    Time-warping neural network for phoneme recognition

  • Author

    Aikawa, Kiyoaki

  • Author_Institution
    NTT Human Interface Lab., Tokyo, Japan
  • fYear
    1991
  • fDate
    18-21 Nov 1991
  • Firstpage
    2122
  • Abstract
    The author investigates a feedforward neural network that can accept phonemes with an arbitrary duration coping with nonlinear time warping. The time-warping neural network is characterized by the time-warping functions embedded between the input layer and the first hidden layer in the network. The input layer accesses three different time points. The accessing points are determined by the time-warping functions. The input spectrum sequence itself is not warped but the accessing-point sequence is warped. The advantage of this network architecture is that the input layer can access the original spectrum sequence. The proposed network demonstrated higher phoneme recognition accuracy than the baseline recognizer based on conventional feedforward neural networks. The recognition accuracy was even higher than that achieved with discrete hidden Markov models
  • Keywords
    neural nets; speech recognition; accessing points; feedforward neural network; network architecture; nonlinear time warping; phoneme recognition; spectrum sequence; speech recognition; Dynamic programming; Feedforward neural networks; Feedforward systems; Heuristic algorithms; Hidden Markov models; Humans; Laboratories; Neural networks; Robustness; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 1991. 1991 IEEE International Joint Conference on
  • Print_ISBN
    0-7803-0227-3
  • Type

    conf

  • DOI
    10.1109/IJCNN.1991.170701
  • Filename
    170701