Title :
Speech recognition by combining pairwise discriminant time-delay neural networks and predictive LR-parser
Author :
Takami, Jun-Ichi ; Kai, Atsuhiko ; Sagayama, Shigeki
Author_Institution :
ATR Interpreting Telephony Res. Labs., Kyoto, Japan
fDate :
30 Sep-1 Oct 1991
Abstract :
A phoneme recognition method using pairwise discriminant time-delay neural networks (PD-TDNNs) and a continuous speech recognition method using the PD-TDNNs are proposed. It is shown that classification-type neural networks have poor robustness against the difference in speaking rates between training data and testing data. To improve the robustness, the authors developed a phoneme recognition method using PD-TDNNs. This method has high performance owing to its particular mechanism, that is a majority decision by multiple less sharp discrimination boundaries. They tested these methods on both consonant recognition and phrase recognition, and obtained higher recognition performance compared with a conventional method using a single TDNN
Keywords :
delays; learning (artificial intelligence); neural nets; speech recognition; AI; consonant recognition; continuous speech recognition; majority decision; multiple less sharp discrimination boundaries; pairwise discriminant time-delay neural networks; performance; phoneme recognition; phrase recognition; robustness; speaking rates; Artificial neural networks; Computer networks; Data mining; Laboratories; Neural networks; Robustness; Speech recognition; Telephony; Testing; Training data;
Conference_Titel :
Neural Networks for Signal Processing [1991]., Proceedings of the 1991 IEEE Workshop
Conference_Location :
Princeton, NJ
Print_ISBN :
0-7803-0118-8
DOI :
10.1109/NNSP.1991.239509