Title :
HMM-based Vietnamese speech synthesis
Author_Institution :
Fac. of Comput. Sci., Univ. of Inf. Technol., Ho Chi Minh City, Vietnam
fDate :
June 28 2015-July 1 2015
Abstract :
In this paper, improving naturalness HMM-based speech synthesis for Vietnamese language is described. By this synthesis method, trajectories of speech parameters are generated from the trained Hidden Markov models. A final speech waveform is synthesized from those speech parameters. The main objective for the development is to achieve maximum naturalness in output speech through key points. Firstly, system uses a high quality recorded Vietnamese speech database appropriate for training, especially in statistical parametric model approach. Secondly, prosodic information such as tone, POS (part of speech) and features based on characteristics of Vietnamese language are added to ensure the quality of synthetic speech. Third, system uses STRAIGHT which showed its ability to produce high-quality voice manipulation and was successfully incorporated into HMM-based speech synthesis. The results collected show that the speech produced by our system has the best result when being compared with the other Vietnamese TTS systems trained from the same speech data.
Keywords :
hidden Markov models; natural language processing; speech synthesis; statistical analysis; HMM-based Vietnamese speech synthesis; POS; STRAIGHT; final speech waveform; hidden Markov models; high quality recorded Vietnamese speech database; high-quality voice manipulation; part of speech; prosodic information; statistical parametric model approach; synthetic speech quality; Context; Context modeling; Databases; Hidden Markov models; Speech; Speech synthesis; Training; HMM-based; STRAIGHT; TTS; Tonal language; Vietnamese speech synthesis; improving naturalness;
Conference_Titel :
Computer and Information Science (ICIS), 2015 IEEE/ACIS 14th International Conference on
Conference_Location :
Las Vegas, NV
DOI :
10.1109/ICIS.2015.7166618