• DocumentCode
    3167453
  • Title

    Japanese and Korean voice search

  • Author

    Schuster, Mike ; Nakajima, Kaisuke

  • Author_Institution
    Google Inc., Mountain View, CA, USA
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    5149
  • Lastpage
    5152
  • Abstract
    This paper describes challenges and solutions for building a successful voice search system as applied to Japanese and Korean at Google. We describe the techniques used to deal with an infinite vocabulary, how modeling completely in the written domain for language model and dictionary can avoid some system complexity, and how we built dictionaries, language and acoustic models in this framework. We show how to deal with the difficulty of scoring results for multiple script languages because of ambiguities. The development of voice search for these languages led to a significant simplification of the original process to build a system for any new language which in in parts became our default process for internationalization of voice search.
  • Keywords
    natural language processing; speech recognition; Japanese voice search system; Korean voice search system; acoustic models; dictionary; language model; multiple script languages; speech recognition; Decision support systems; Helium; Japanese; Korean; Speech recognition; voice search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6289079
  • Filename
    6289079