Title :
Expanding the vocabulary of a connectionist recognizer trained on the DARPA Resource Management corpus
Author :
Lucke, H. ; Fallside, F.
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Abstract :
It is shown how the compositional representation (CR) previously used for lexical access from sub-word recognizers for a relatively small word vocabulary can be extended to much larger vocabularies without further training. This is demonstrated for the DARPA Resource Management database where, using sub-word units as input, words are presented distributively over a fixed number of units and classified using a simple network. Initially, the architecture is trained on 147 words achieving an accuracy 91.2%. Then, leaving the recognizer unchanged, it is shown how additional output units can be added to the network to increase the vocabulary to the complete set of 975 phonetically distinct words. On this extended vocabulary the performance dropped to 66% but this drop is less than the expected drop due to the perplexity increase. Further improvement would be achieved by improving the performance on the original data set
Keywords :
learning (artificial intelligence); neural nets; speech recognition equipment; vocabulary; DARPA Resource Management corpus; accuracy; compositional representation; connectionist recognizer; performance; perplexity; subword units; training; vocabulary expansion; Chromium; Data structures; Databases; Impedance; Neural networks; Recurrent neural networks; Resource management; Speech processing; Speech recognition; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-0532-9
DOI :
10.1109/ICASSP.1992.225836