DocumentCode :
1264358
Title :
A novel objective function for improved phoneme recognition using time-delay neural networks
Author :
Hampshire, John B., II ; Waibel, Alexander H.
Author_Institution :
Carnegie-Mellon Univ., Pittsburgh, PA, USA
Volume :
1
Issue :
2
fYear :
1990
fDate :
6/1/1990 12:00:00 AM
Firstpage :
216
Lastpage :
228
Abstract :
Single-speaker and multispeaker recognition results are presented for the voice-stop consonants /b,d,g/ using time-delay neural networks (TDNNs) with a number of enhancements, including a new objective function for training these networks. The new objective function, called the classification figure of merit (CFM), differs markedly from the traditional mean-squared-error (MSE) objective function and the related cross entropy (CE) objective function. Where the MSE and CE objective functions seek to minimize the difference between each output node and its ideal activation, the CFM function seeks to maximize the difference between the output activation of the node representing incorrect classifications. A simple arbitration mechanism is used with all three objective functions to achieve a median 30% reduction in the number of misclassifications when compared to TDNNs trained with the traditional MSE back-propagation objective function alone
Keywords :
delays; neural nets; speech recognition; classification figure of merit; cross entropy; mean-squared-error; objective function; phoneme recognition; time-delay neural networks; voice-stop consonants; Computer science; Costs; Entropy; Error analysis; Laboratories; Neural networks; Speech recognition; Statistics;
fLanguage :
English
Journal_Title :
Neural Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9227
Type :
jour
DOI :
10.1109/72.80233
Filename :
80233
Link To Document :
بازگشت