Title :
A recurrent time-delay neural network for improved phoneme recognition
Author :
Greco, Fabio ; Paoloni, Andrea ; Ravaioli, G.
Author_Institution :
Fondazione Ugo Bordoni, Roma, Italy
Abstract :
The authors propose a modification to the structure of the time-delay neural network (TDNN), obtained through feedback at the first-hidden layer level. The experiment carried out with the new model, called RTDNN (recurrent TDNN), consists of the classification of the unvoiced plosive phonemes. These were extracted from an initial and intermediate position in a list of the most common Italian words, uttered by a male speaker, thus obtaining 250 tokens per phoneme. The training was carried out through a modified variant of back propagation, known as BPS (back propagation for sequences), using half of the tokens for learning and the remaining for the test. The error rate trend thus obtained shows a 27% decrease in a particular range of the magnitude of feedback, with values ranging from 5% for the original TDNN model with no feedback to 3.6% for the proposed RTDNN model
Keywords :
delays; neural nets; speech recognition; RTDNN; back propagation; error rate; feedback; first-hidden layer level; male speaker; most common Italian words; phoneme recognition; recurrent time-delay neural network; speech recognition; unvoiced plosive phonemes; Data mining; Error analysis; Neural networks; Neurofeedback; Pattern classification; Recurrent neural networks; Shape; Signal analysis; Speech recognition; Testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
0-7803-0003-3
DOI :
10.1109/ICASSP.1991.150283