Title :
A neural fuzzy training approach for continuous speech recognition improvement
Author :
Komori, Yasuhiro
Author_Institution :
ATR Interpreting Telephony Res. Lab., Kyoto, Japan
Abstract :
A novel training method for phoneme identification neural networks, called a neural fuzzy training method, is proposed. The difference between the proposed method and the conventional method is that the target values of each training sample are given as fuzzy phoneme class information instead of discrete phoneme class information. In the conventional training method, the target values are defined as 0s or 1s. However, in the proposed method, the target values are defined as likelihoods to phoneme classes in between 0 and 1. This likelihood is computed by a likelihood transformation function according to the distance between the input sample and its nearest sample belonging to each phoneme class in the training set. The effectiveness of the proposed method is shown by an 18-consonant identification experiment and a continuous speech recognition experiment using the ATR isolated word and phrase database. Improvements can be observed in every experiment, particularly on the continuous speech recognition results
Keywords :
backpropagation; fuzzy logic; neural nets; speech recognition; 18-consonant identification experiment; continuous speech recognition; fuzzy phoneme class information; input sample; isolated word and phrase database; likelihood transformation function; nearest sample; neural fuzzy training method; phoneme identification neural networks; target values; training method; training sample; Databases; Fuzzy neural networks; Laboratories; Natural languages; Neural networks; Noise robustness; Smoothing methods; Speech recognition; Telephony; Testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-0532-9
DOI :
10.1109/ICASSP.1992.225886