DocumentCode :
2802704
Title :
On the construction of unit databanks for text-to-speech systems
Author :
Latsch, Vagner L. ; Netto, Sergio L.
Author_Institution :
COPPE/UFRJ, Rio de Janeiro
fYear :
2006
fDate :
3-6 Sept. 2006
Firstpage :
340
Lastpage :
343
Abstract :
This work deals with one stage in the development of a text-to-speech (TTS) system, which demands a great amount of time and effort, and is strongly related to the resulting speech quality: The determination of the speech-unit databank. For that matter, we present a software tool, the so-called Editor, integrating all major steps in the database determination in a single environment. The whole process includes recording, segmentation, and labeling of speech units to be concatenated in the time domain. The Editor includes a low-cost and precise method for determining the pitch marks, utilizing an auxiliary signal obtained from a contact (throat) microphone. For the phonetic speech labeling, we revise an algorithm for acoustic segmentation, which yields interesting results when proper operation conditions are imposed. The result is a simplified procedure for creating a complete unit database, fully integrated into a single and user- friendly system.
Keywords :
database management systems; software tools; speech processing; speech synthesis; speech-based user interfaces; Editor software tool; acoustic segmentation; phonetic speech labeling; text-to-speech system; unit databank; Concatenated codes; Databases; Labeling; Microphones; Signal processing; Signal processing algorithms; Software tools; Speech processing; Speech synthesis; Time domain analysis; Speech signal processing; and text-to-speech; speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Telecommunications Symposium, 2006 International
Conference_Location :
Fortaleza, Ceara
Print_ISBN :
978-85-89748-04-9
Electronic_ISBN :
978-85-89748-04-9
Type :
conf
DOI :
10.1109/ITS.2006.4433295
Filename :
4433295
Link To Document :
بازگشت