DocumentCode :
2029518
Title :
Disambiguation for polyphones of Chinese based on two-pass unified approach
Author :
Huang, Feng-Long ; Lin, Jun-Hong ; Lin, Xin-Wei
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. United Univ., Miaoli, Taiwan
fYear :
2010
fDate :
16-18 Dec. 2010
Firstpage :
603
Lastpage :
607
Abstract :
The paper addresses the issue of Chinese polyphones and the disambiguity approach. Three methods, dictionary matching, language models and voting scheme, are used to retrieve the token´s information and then disambiguate the prediction of polyphones. The best precision for these methods achieves 92.72%. Furthermore we proposed the two-pass unified approach to improve the performance with various empirical thresholds. Our approach is superior to the well-known MS Word 2007, and the final precision rate reaches 94.3%. It proves that the proposed approach can resolve effectively text-to-phoneme conversion in Chinese TTS system.
Keywords :
Internet; learning (artificial intelligence); natural language processing; speech processing; speech synthesis; Chinese TTS system; Chinese polyphones; MS Word 2007; dictionary matching; disambiguation; language models; text-to-phoneme conversion; text-to-speech; token´s information; two-pass unified approach; voting scheme; Computational modeling; Dictionaries; Electronic learning; Markov processes; Probability; Semantics; Testing; Language Model; Text-to-Speech(TTS); Two-Pass Unified Approach; Word Sense Disambiguation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Symposium (ICS), 2010 International
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-7639-8
Type :
conf
DOI :
10.1109/COMPSYM.2010.5685440
Filename :
5685440
Link To Document :
بازگشت