• DocumentCode
    1009764
  • Title

    Automatic recognition of pitch movements using multilayer perceptron and time-Delay Recursive neural network

  • Author

    Kim, Sung-Sunk ; Hasegawa-Johnson, Mark ; Chen, Ken

  • Author_Institution
    Yong-In Univ., Seoul, South Korea
  • Volume
    11
  • Issue
    7
  • fYear
    2004
  • fDate
    7/1/2004 12:00:00 AM
  • Firstpage
    645
  • Lastpage
    648
  • Abstract
    This letter demonstrates hidden Markov model (HMM), multilayer perceptron (MLP), and time-delay recursive neural network (TDRNN) architectures for the purpose of recognizing pitch accents given observation of the F0 and energy trajectories. At an insertion error rate of 25%, the deletion error rates of the MLP, TDRNN, and HMM are 13.2%, 7.9%, and 32.7%, respectively, despite the fact that both MLP and TDRNN have 70% fewer trainable parameters than the HMM. Error analysis suggests that low-pitch accents may require long-term context to correctly recognize, while high-pitch accents may be recognizable based on local pitch contour.
  • Keywords
    error analysis; feedforward neural nets; hidden Markov models; multilayer perceptrons; natural language interfaces; speech recognition; HMM; MLP; TDRNN; error analysis; feedforward neural networks; hidden Markov model; local pitch contour; low-pitch accent; multilayer perceptron; natural language interfaces; speech recognition; time-delay recursive neural network; Error analysis; Frequency; Hidden Markov models; Multi-layer neural network; Multilayer perceptrons; Natural languages; Neural networks; Recurrent neural networks; Speech analysis; Speech recognition;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2004.830114
  • Filename
    1306484