DocumentCode
1009764
Title
Automatic recognition of pitch movements using multilayer perceptron and time-Delay Recursive neural network
Author
Kim, Sung-Sunk ; Hasegawa-Johnson, Mark ; Chen, Ken
Author_Institution
Yong-In Univ., Seoul, South Korea
Volume
11
Issue
7
fYear
2004
fDate
7/1/2004 12:00:00 AM
Firstpage
645
Lastpage
648
Abstract
This letter demonstrates hidden Markov model (HMM), multilayer perceptron (MLP), and time-delay recursive neural network (TDRNN) architectures for the purpose of recognizing pitch accents given observation of the F0 and energy trajectories. At an insertion error rate of 25%, the deletion error rates of the MLP, TDRNN, and HMM are 13.2%, 7.9%, and 32.7%, respectively, despite the fact that both MLP and TDRNN have 70% fewer trainable parameters than the HMM. Error analysis suggests that low-pitch accents may require long-term context to correctly recognize, while high-pitch accents may be recognizable based on local pitch contour.
Keywords
error analysis; feedforward neural nets; hidden Markov models; multilayer perceptrons; natural language interfaces; speech recognition; HMM; MLP; TDRNN; error analysis; feedforward neural networks; hidden Markov model; local pitch contour; low-pitch accent; multilayer perceptron; natural language interfaces; speech recognition; time-delay recursive neural network; Error analysis; Frequency; Hidden Markov models; Multi-layer neural network; Multilayer perceptrons; Natural languages; Neural networks; Recurrent neural networks; Speech analysis; Speech recognition;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/LSP.2004.830114
Filename
1306484
Link To Document