Developing a method to build Japanese speech recognition system based on 3-gram language model expansion with Google database

Author

Shimada, Toshiaki ; Nisimura, Ryuichi ; Tanaka, Masayasu ; Kawahara, Hideki ; Irino, Toshio

Author_Institution

Faculty of Systems Engineering, Wakayama University, 640-8510, Japan

fYear

2013

fDate

1-8 Jan. 2013

Firstpage

1

Lastpage

6

Abstract

We have developed a method to build a Japanese automatic speech recognition (ASR) system based on 3-gram language model expansion with the Google database. Our aim is to enhance the recognition accuracy of ASR systems based on the 3-gram language model, even in cases where the language model is trained using short text segments. We investigate a practical approach to expanding language models by using 3-gram information from external web documents. In addition, we filter 3-gram entries on the basis of term frequency-inverse document frequency (TF-IDF) scores and the output of the Yahoo! web API to prevent the unnecessary addition of redundant or irrelevant 3-gram entries. In the experiments, we achieved an improvement of 0.71% in the word error rate and proved that the recognition accuracy can be improved by combining the proposed method and the traditional back-off smoothing technique without any costs being incurred in collecting additional text for training the model.

Keywords

Accuracy; Databases; Google; Hidden Markov models; Smoothing methods; Speech recognition; Training; Google database; Language modeling; Speech recognition system;

fLanguage

English

Publisher

ieee

Conference_Titel

Conference Anthology, IEEE

Conference_Location

China

Type

conf

DOI

10.1109/ANTHOLOGY.2013.6784781

Filename

6784781