DocumentCode
2019725
Title
An advanced NLP framework for high-quality Text-to-Speech synthesis
Author
Ungurean, Catalin ; Burileanu, Dragos
Author_Institution
Fac. of Electron., Telecommun. & IT, Univ. Politeh. of Bucharest, Bucharest, Romania
fYear
2011
fDate
18-21 May 2011
Firstpage
1
Lastpage
6
Abstract
In order to build a TTS (Text-to-Speech) synthesis system one must provide two key components: a NLP (Natural Language Processing) stage, which essentially operates on the input text, and a speech generation stage to produce the desired output. These two distinct levels must exchange both data and commands to produce intelligible and natural speech. As the complete TTS task relies on many distinct scientific areas, any achievement toward standardization can minimize the effort and increase the dynamic of the results. This paper gives an overview of the NLP stage in the TTS system for Romanian language built by our collective, and describes the integration into the system of SSML (Speech Synthesis Markup Language), as a nowadays well recognized standard for TTS document authoring and inter-modules communication.
Keywords
natural language processing; speech synthesis; Romanian language; natural language processing; speech generation; speech synthesis markup language; text to speech synthesis; Dictionaries; Natural language processing; Pragmatics; Speech; Stress; Testing; Training; NLP; SSML; TTS; diacritic restoration; letter-to-phone conversion; lexical stress assignment; syllabification;
fLanguage
English
Publisher
ieee
Conference_Titel
Speech Technology and Human-Computer Dialogue (SpeD), 2011 6th Conference on
Conference_Location
Brasov
Print_ISBN
978-1-4577-0440-6
Type
conf
DOI
10.1109/SPED.2011.5940733
Filename
5940733
Link To Document