DocumentCode :
2189838
Title :
Categorial-grammar-based phrase break prediction
Author :
Saychum, S. ; Hansakunbuntheung, C. ; Thatphithakkul, N. ; Ruangrajitpakorn, T. ; Wutiwiwatchai, C. ; Supnithi, T. ; Chotimongkol, A. ; Thangthai, A.
Author_Institution :
Nat. Electron. & Comput. Technol. Center, Pathumthani, Thailand
fYear :
2011
fDate :
17-19 May 2011
Firstpage :
954
Lastpage :
957
Abstract :
Part-of-speech (POS) has been widely used as the main feature for predicting phrase breaks in text-to-speech synthesis (TTS) systems. However, POS does not clearly represent syntactic information that is necessary for analyzing the grammatical tree structure of a language to assign phrase breaks. Instead of using POS, this paper proposes to use categorial grammar (CG), which embeds fine syntactic information, for Thai as a key feature to predict phrase breaks in Thai Texts. The performances of phrase break predictions using CG, POS, and their reduced sets are compared using classification and regression tree (CART) for learning and predicting phrase break locations. The experimental results showed that the phrase break prediction using CGs as the main feature gave the best performance among the tested features (Precision=73.15%, Recall = 96.96%, F-measure=83.39%).
Keywords :
grammars; regression analysis; speech synthesis; trees (mathematics); Thai texts; categorial-grammar; classification and regression tree; part-of-speech; phrase break prediction; text-to-speech synthesis systems; Accuracy; Data models; Helium; Presses; Syntactics; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI-CON), 2011 8th International Conference on
Conference_Location :
Khon Kaen
Print_ISBN :
978-1-4577-0425-3
Type :
conf
DOI :
10.1109/ECTICON.2011.5948000
Filename :
5948000
Link To Document :
بازگشت