Title :
Develop a HMM-based Taiwanese text-to-speech system
Author :
Sher, Yung-Ji ; Hsu, Ming-Chun ; Chiu, Yu-Hsien ; Chung, Kao-Chi
Author_Institution :
Dept. of Special Educ., Nat. Taiwan Normal Univ., Taipei, Taiwan
Abstract :
In Taiwan, speech technology development has been focused on Mandarin system; there is lack of related researches of domestic Taiwanese speech. This study is aimed to develop a HMM-based Taiwanese speech synthesis system, and the established sub-syllables units´ database is used to be the synthesis units for all of the possible Taiwanese semantic syllables. The multiple accent corpus-based databases were developed by all combination of basic phonemes of vowels, consonants and 8-tones in Modern Literal Taiwanese (MLT) in this study. Based on the phoneme tables, this research provide fundamental database for speech analysis and synthesis in Taiwanese spelling systems. The principles and procedures in constructing phonetically balanced sentences are constituted by Taiwanese grammatical in syntax and semantics, and employ all the possible Taiwanese. Pattern recognition was applied to extract features´ parameter codes including formants, pitch, amplitudes, and duration. The collected text corpus consists of MLT sentences and syllables from MLT books. A Taiwanese balanced sentences speech database including MLT sentences is established through training and analyzing system developed by windows programming, and another rare phoneme units are generated to be included in the database. The phonetic set of Taiwanese tonal phonemes is generated from the HTK recognition results. Through this research, the established TTS system may contribute to the education and training of native Taiwanese for the children, elder, and speech disabled.
Keywords :
feature extraction; hidden Markov models; natural language processing; speech processing; speech synthesis; spelling aids; text analysis; HMM-based Taiwanese text-to-speech synthesis system; Mandarin system; Taiwanese semantic syllables; Taiwanese spelling system; corpus based database; feature parameter codes extraction; hidden Markov model; modern literal Taiwanese; pattern recognition; speech analysis; speech technology development; windows programming; Databases; Hidden Markov models; Robustness; Speech; Speech synthesis; Training; Hidden Markov Model (HMM); Taiwanese; Text-to-speech (TTS) System;
Conference_Titel :
Software Technology and Engineering (ICSTE), 2010 2nd International Conference on
Conference_Location :
San Juan, PR
Print_ISBN :
978-1-4244-8667-0
Electronic_ISBN :
978-1-4244-8666-3
DOI :
10.1109/ICSTE.2010.5608854