• DocumentCode
    3631368
  • Title

    Neural network based language models for highly inflective languages

  • Author

    Tomas Mikolov;Jiri Kopecky;Lukas Burget;Ondrej Glembek;Jan ?Cernocky

  • Author_Institution
    Speech@FIT, Faculty of Information Technology, Brno University of Technology, Czech Republic
  • fYear
    2009
  • Firstpage
    4725
  • Lastpage
    4728
  • Abstract
    Speech recognition of inflectional and morphologically rich languages like Czech is currently quite a challenging task, because simple n-gram techniques are unable to capture important regularities in the data. Several possible solutions were proposed, namely class based models, factored models, decision trees and neural networks. This paper describes improvements obtained in recognition of spoken Czech lectures using language models based on neural networks. Relative reductions in word error rate are more than 15% over baseline obtained with adapted 4-gram backoff language model using modified Kneser-Ney smoothing.
  • Keywords
    "Neural networks","Natural languages","Speech recognition","Neurons","Probability distribution","Clustering algorithms","Information technology","Decision trees","Error analysis","Smoothing methods"
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    2379-190X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960686
  • Filename
    4960686