Title :
Automatic recognition of pitch movements using multilayer perceptron and time-Delay Recursive neural network
Author :
Kim, Sung-Sunk ; Hasegawa-Johnson, Mark ; Chen, Ken
Author_Institution :
Yong-In Univ., Seoul, South Korea
fDate :
7/1/2004 12:00:00 AM
Abstract :
This letter demonstrates hidden Markov model (HMM), multilayer perceptron (MLP), and time-delay recursive neural network (TDRNN) architectures for the purpose of recognizing pitch accents given observation of the F0 and energy trajectories. At an insertion error rate of 25%, the deletion error rates of the MLP, TDRNN, and HMM are 13.2%, 7.9%, and 32.7%, respectively, despite the fact that both MLP and TDRNN have 70% fewer trainable parameters than the HMM. Error analysis suggests that low-pitch accents may require long-term context to correctly recognize, while high-pitch accents may be recognizable based on local pitch contour.
Keywords :
error analysis; feedforward neural nets; hidden Markov models; multilayer perceptrons; natural language interfaces; speech recognition; HMM; MLP; TDRNN; error analysis; feedforward neural networks; hidden Markov model; local pitch contour; low-pitch accent; multilayer perceptron; natural language interfaces; speech recognition; time-delay recursive neural network; Error analysis; Frequency; Hidden Markov models; Multi-layer neural network; Multilayer perceptrons; Natural languages; Neural networks; Recurrent neural networks; Speech analysis; Speech recognition;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2004.830114