DocumentCode
2909689
Title
Invited Talks
Author
Furui, S.
Author_Institution
Tokyo Inst. of Technol., Tokyo, Japan
fYear
2011
fDate
15-17 Nov. 2011
Abstract
More than 6000 living languages are spoken in the world today, and the majority of them are concentrating in Asia. Every language has its own specific acoustic as well as linguistic characteristics that require special modeling techniques. This talk presents our recent experiences in regard to building automatic speech recognition (ASR) systems for the Indonesian, Thai and Chinese languages. For Indonesian, we are building a spoken-query information retrieval (IR) system. In order to solve the problem of a large variation of proper noun and English word pronunciation, we have applied proper noun-specific adaptation in acoustic modeling and rule-based English- to-Indonesian phoneme mapping. For Thai, since there is no word boundary in the written form, we have proposed a new method for automatically creating word-like units from a text corpus, and to recognize spoken style utterances we have applied topic and speaking style adaptation to the language model. In spoken Chinese, long organization names are often abbreviated, and abbreviated utterances cannot be recognized if the abbreviations are not included in the dictionary. We have proposed a new method for automatically generating Chinese abbreviations, and by expanding the vocabulary using the generated abbreviations, we have significantly improved the performance of voice search. This talk includes several recent research activities for the Japanese language.
Keywords
natural language processing; speech recognition; ASR research; Asian languages; English word pronunciation; acoustic modeling; automatic speech recognition systems; linguistic characteristics; special modeling techniques; spoken query information retrieval system;
fLanguage
English
Publisher
ieee
Conference_Titel
Asian Language Processing (IALP), 2011 International Conference on
Conference_Location
Penang
Print_ISBN
978-1-4577-1733-8
Type
conf
DOI
10.1109/IALP.2011.9
Filename
6121453
Link To Document