• DocumentCode
    1801152
  • Title

    Developing a method to build Japanese speech recognition system based on 3-gram language model expansion with Google database

  • Author

    Shimada, Toshiaki ; Nisimura, Ryuichi ; Tanaka, Masayasu ; Kawahara, Hideki ; Irino, Toshio

  • Author_Institution
    Faculty of Systems Engineering, Wakayama University, 640-8510, Japan
  • fYear
    2013
  • fDate
    1-8 Jan. 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    We have developed a method to build a Japanese automatic speech recognition (ASR) system based on 3-gram language model expansion with the Google database. Our aim is to enhance the recognition accuracy of ASR systems based on the 3-gram language model, even in cases where the language model is trained using short text segments. We investigate a practical approach to expanding language models by using 3-gram information from external web documents. In addition, we filter 3-gram entries on the basis of term frequency-inverse document frequency (TF-IDF) scores and the output of the Yahoo! web API to prevent the unnecessary addition of redundant or irrelevant 3-gram entries. In the experiments, we achieved an improvement of 0.71% in the word error rate and proved that the recognition accuracy can be improved by combining the proposed method and the traditional back-off smoothing technique without any costs being incurred in collecting additional text for training the model.
  • Keywords
    Accuracy; Databases; Google; Hidden Markov models; Smoothing methods; Speech recognition; Training; Google database; Language modeling; Speech recognition system;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Conference Anthology, IEEE
  • Conference_Location
    China
  • Type

    conf

  • DOI
    10.1109/ANTHOLOGY.2013.6784781
  • Filename
    6784781