DocumentCode
3631368
Title
Neural network based language models for highly inflective languages
Author
Tomas Mikolov;Jiri Kopecky;Lukas Burget;Ondrej Glembek;Jan ?Cernocky
Author_Institution
Speech@FIT, Faculty of Information Technology, Brno University of Technology, Czech Republic
fYear
2009
Firstpage
4725
Lastpage
4728
Abstract
Speech recognition of inflectional and morphologically rich languages like Czech is currently quite a challenging task, because simple n-gram techniques are unable to capture important regularities in the data. Several possible solutions were proposed, namely class based models, factored models, decision trees and neural networks. This paper describes improvements obtained in recognition of spoken Czech lectures using language models based on neural networks. Relative reductions in word error rate are more than 15% over baseline obtained with adapted 4-gram backoff language model using modified Kneser-Ney smoothing.
Keywords
"Neural networks","Natural languages","Speech recognition","Neurons","Probability distribution","Clustering algorithms","Information technology","Decision trees","Error analysis","Smoothing methods"
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
2379-190X
Type
conf
DOI
10.1109/ICASSP.2009.4960686
Filename
4960686
Link To Document