DocumentCode :
3619606
Title :
AlpSynth - concatenation-based speech synthesis for the Slovenian language
Author :
J.Z. Gros;A. Mihelic;N. Pavesic;M. Zganec;S. Gruden
Author_Institution :
Alpineon RTD, Ljubljana
fYear :
2005
fDate :
6/27/1905 12:00:00 AM
Firstpage :
213
Lastpage :
216
Abstract :
The paper focuses on the design and collection of a speech corpus of elemental speech units for AlpSynth, a corpus-driven Slovenian TTS system. We describe the design procedures for a new speech corpus: purpose definition, content selection, definition of recording conditions and requirements, corpus segmentation and annotation. First we describe and comment the results of a frequency analysis of Slovenian allophone strings performed on a large Slovenian input text that has been converted to allophones. Further we present a method we designed for selection of a compact and efficient set of Slovenian sentences out of a large text corpus so as to minimize the final representative speech corpus. The selected sentences cover all the desired most frequent Slovenian quadphones, triphones and subsequently diphones. We describe the recording sessions and recording conditions. We continue describing the corpus annotation process. Finally, we describe the archive structure of the spoken corpus and present the information on its structure, content and size
Keywords :
"Speech synthesis","Natural languages","Speech recognition","Telephony","Frequency conversion","Design methodology","Speech processing","Costs","User interfaces","Resumes"
Publisher :
ieee
Conference_Titel :
ELMAR, 2005. 47th International Symposium
Print_ISBN :
953-7044-01-4
Type :
conf
DOI :
10.1109/ELMAR.2005.193680
Filename :
1505681
Link To Document :
بازگشت