DocumentCode
3167453
Title
Japanese and Korean voice search
Author
Schuster, Mike ; Nakajima, Kaisuke
Author_Institution
Google Inc., Mountain View, CA, USA
fYear
2012
fDate
25-30 March 2012
Firstpage
5149
Lastpage
5152
Abstract
This paper describes challenges and solutions for building a successful voice search system as applied to Japanese and Korean at Google. We describe the techniques used to deal with an infinite vocabulary, how modeling completely in the written domain for language model and dictionary can avoid some system complexity, and how we built dictionaries, language and acoustic models in this framework. We show how to deal with the difficulty of scoring results for multiple script languages because of ambiguities. The development of voice search for these languages led to a significant simplification of the original process to build a system for any new language which in in parts became our default process for internationalization of voice search.
Keywords
natural language processing; speech recognition; Japanese voice search system; Korean voice search system; acoustic models; dictionary; language model; multiple script languages; speech recognition; Decision support systems; Helium; Japanese; Korean; Speech recognition; voice search;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6289079
Filename
6289079
Link To Document