• DocumentCode
    3494331
  • Title

    MLP emulation of N-gram models as a first step to connectionist language modeling

  • Author

    Castro, Maria Jose ; Prat, F. ; Casacuberta, Francisco

  • Author_Institution
    Dept. de Sistemas Inf. y Comput., Univ. Politecnica de Valencia, Spain
  • Volume
    2
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    910
  • Abstract
    In problems such as automatic speech recognition and machine translation, where the system response must be a sentence in a given language, language models are employed in order to improve system performance. These language models are usually N-gram models (for instance, bigram or trigram models) which are estimated from large text databases using the occurrence frequencies of these N-grams. Nakamura and Shikano (1989) empirically showed how multilayer perceptrons can emulate trigram model predictive capabilities with additional generalization features. Our paper discusses Nakamura and Shikano´s work, provides new empirical evidence on multilayer perceptron capability to emulate N-gram models, and proposes new directions for extending neural network-based language models. The experimental work we present here compares connectionist phonological bigram models with a conventional one using different measures, which include recognition performances in a Spanish acoustic-phonetic decoding task
  • Keywords
    speech recognition; N-gram models; Spanish acoustic-phonetic decoding task; automatic speech recognition; bigram models; connectionist language modeling; connectionist phonological bigram models; generalization features; language models; large text databases; machine translation; neural network-based language models; occurrence frequencies; predictive capabilities; recognition performances; trigram models;
  • fLanguage
    English
  • Publisher
    iet
  • Conference_Titel
    Artificial Neural Networks, 1999. ICANN 99. Ninth International Conference on (Conf. Publ. No. 470)
  • Conference_Location
    Edinburgh
  • ISSN
    0537-9989
  • Print_ISBN
    0-85296-721-7
  • Type

    conf

  • DOI
    10.1049/cp:19991228
  • Filename
    818053