Title :
Parallelism, hierarchy, scaling in time-delay neural networks for spotting Japanese phonemes CV-syllables
Author :
Sawai, Hidefumi ; Waibei ; Haffner, Patrick ; Miyatake, Masanori ; Shikano, Kiyohiro
Abstract :
To extend the performance of TDNNs (time-delay neural networks) to all phoneme recognition and word/continuous speech recognition, the authors present several techniques. First, they show that it is possible to scale up the TDNN to a large phonemic TDNN aimed at discriminating all phonemes without loss of recognition performance and without excessive training tokens. Second, the authors propose fast backpropagation learning methods which make it possible to train a large phonemic TDNN within 1.5 hours. Finally, they show several methods for spotting Japanese CV syllables/phonemes in input speech based on TDNNs: they constructed a TDNN which can discriminate a single CV syllable or phoneme. Syllable and phoneme spotting experiments show excellent results, with syllable and phoneme spotting rates of better than 96.7% and 92% correct, respectively.<>
Keywords :
computerised pattern recognition; learning systems; neural nets; parallel architectures; speech recognition; Japanese CV syllables; Japanese phonemes; continuous speech recognition; fast backpropagation learning methods; hierarchy; parallelism; scaling; time-delay neural networks; word recognition; Learning systems; Neural networks; Parallel architectures; Pattern recognition; Speech recognition;
Conference_Titel :
Neural Networks, 1989. IJCNN., International Joint Conference on
Conference_Location :
Washington, DC, USA
DOI :
10.1109/IJCNN.1989.118682