Title :
MLP emulation of N-gram models as a first step to connectionist language modeling
Author :
Castro, Maria Jose ; Prat, F. ; Casacuberta, Francisco
Author_Institution :
Dept. de Sistemas Inf. y Comput., Univ. Politecnica de Valencia, Spain
Abstract :
In problems such as automatic speech recognition and machine translation, where the system response must be a sentence in a given language, language models are employed in order to improve system performance. These language models are usually N-gram models (for instance, bigram or trigram models) which are estimated from large text databases using the occurrence frequencies of these N-grams. Nakamura and Shikano (1989) empirically showed how multilayer perceptrons can emulate trigram model predictive capabilities with additional generalization features. Our paper discusses Nakamura and Shikano´s work, provides new empirical evidence on multilayer perceptron capability to emulate N-gram models, and proposes new directions for extending neural network-based language models. The experimental work we present here compares connectionist phonological bigram models with a conventional one using different measures, which include recognition performances in a Spanish acoustic-phonetic decoding task
Keywords :
speech recognition; N-gram models; Spanish acoustic-phonetic decoding task; automatic speech recognition; bigram models; connectionist language modeling; connectionist phonological bigram models; generalization features; language models; large text databases; machine translation; neural network-based language models; occurrence frequencies; predictive capabilities; recognition performances; trigram models;
Conference_Titel :
Artificial Neural Networks, 1999. ICANN 99. Ninth International Conference on (Conf. Publ. No. 470)
Conference_Location :
Edinburgh
Print_ISBN :
0-85296-721-7
DOI :
10.1049/cp:19991228