Title :
A concatenative speech synthesis for monosyllabic languages with limited data
Author :
Trung-Nghia Phung ; Chi Luong Mai ; Akagi, Masato
Author_Institution :
Japan Adv. Inst. of Sci. & Technol., Ishikawa, Japan
Abstract :
Quality of unit-based concatenative speech synthesis is low while that of corpus-based concatenative speech synthesis with unit selection is great natural. However, unit selection requires a huge data for concatenation that reduces the range of its applications. In this paper, by using temporal decomposition for modeling contextual effects intra-syllable and inter-syllables, we propose a context-fitting unit modification method and a context-matching unit selection method. The two proposed context-specific methods are used in our proposed syllable-based concatenative speech synthesis applied for monosyllabic languages. The experimental results with a Vietnamese speech synthesis using a small corpus support that the proposed methods are efficient. As a consequence, the naturalness and intelligibility of the proposed speech synthesis is high even when we have only limited data for concatenation.
Keywords :
natural language processing; speech synthesis; Vietnamese speech synthesis; context-fitting unit modification method; context-matching unit selection method; context-specific method; contextual effects intersyllable modeling; contextual effects intrasyllable modeling; corpus-based concatenative speech synthesis; monosyllabic language; syllable-based concatenative speech synthesis; temporal decomposition; unit-based concatenative speech synthesis; Cascading style sheets; Context; Context modeling; Hidden Markov models; Interpolation; Speech; Speech synthesis;
Conference_Titel :
Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
Conference_Location :
Hollywood, CA
Print_ISBN :
978-1-4673-4863-8