DocumentCode
2029518
Title
Disambiguation for polyphones of Chinese based on two-pass unified approach
Author
Huang, Feng-Long ; Lin, Jun-Hong ; Lin, Xin-Wei
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. United Univ., Miaoli, Taiwan
fYear
2010
fDate
16-18 Dec. 2010
Firstpage
603
Lastpage
607
Abstract
The paper addresses the issue of Chinese polyphones and the disambiguity approach. Three methods, dictionary matching, language models and voting scheme, are used to retrieve the token´s information and then disambiguate the prediction of polyphones. The best precision for these methods achieves 92.72%. Furthermore we proposed the two-pass unified approach to improve the performance with various empirical thresholds. Our approach is superior to the well-known MS Word 2007, and the final precision rate reaches 94.3%. It proves that the proposed approach can resolve effectively text-to-phoneme conversion in Chinese TTS system.
Keywords
Internet; learning (artificial intelligence); natural language processing; speech processing; speech synthesis; Chinese TTS system; Chinese polyphones; MS Word 2007; dictionary matching; disambiguation; language models; text-to-phoneme conversion; text-to-speech; token´s information; two-pass unified approach; voting scheme; Computational modeling; Dictionaries; Electronic learning; Markov processes; Probability; Semantics; Testing; Language Model; Text-to-Speech(TTS); Two-Pass Unified Approach; Word Sense Disambiguation;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Symposium (ICS), 2010 International
Conference_Location
Tainan
Print_ISBN
978-1-4244-7639-8
Type
conf
DOI
10.1109/COMPSYM.2010.5685440
Filename
5685440
Link To Document