DocumentCode :
1801152
Title :
Developing a method to build Japanese speech recognition system based on 3-gram language model expansion with Google database
Author :
Shimada, Toshiaki ; Nisimura, Ryuichi ; Tanaka, Masayasu ; Kawahara, Hideki ; Irino, Toshio
Author_Institution :
Faculty of Systems Engineering, Wakayama University, 640-8510, Japan
fYear :
2013
fDate :
1-8 Jan. 2013
Firstpage :
1
Lastpage :
6
Abstract :
We have developed a method to build a Japanese automatic speech recognition (ASR) system based on 3-gram language model expansion with the Google database. Our aim is to enhance the recognition accuracy of ASR systems based on the 3-gram language model, even in cases where the language model is trained using short text segments. We investigate a practical approach to expanding language models by using 3-gram information from external web documents. In addition, we filter 3-gram entries on the basis of term frequency-inverse document frequency (TF-IDF) scores and the output of the Yahoo! web API to prevent the unnecessary addition of redundant or irrelevant 3-gram entries. In the experiments, we achieved an improvement of 0.71% in the word error rate and proved that the recognition accuracy can be improved by combining the proposed method and the traditional back-off smoothing technique without any costs being incurred in collecting additional text for training the model.
Keywords :
Accuracy; Databases; Google; Hidden Markov models; Smoothing methods; Speech recognition; Training; Google database; Language modeling; Speech recognition system;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Conference Anthology, IEEE
Conference_Location :
China
Type :
conf
DOI :
10.1109/ANTHOLOGY.2013.6784781
Filename :
6784781
Link To Document :
بازگشت