• DocumentCode
    2019725
  • Title

    An advanced NLP framework for high-quality Text-to-Speech synthesis

  • Author

    Ungurean, Catalin ; Burileanu, Dragos

  • Author_Institution
    Fac. of Electron., Telecommun. & IT, Univ. Politeh. of Bucharest, Bucharest, Romania
  • fYear
    2011
  • fDate
    18-21 May 2011
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In order to build a TTS (Text-to-Speech) synthesis system one must provide two key components: a NLP (Natural Language Processing) stage, which essentially operates on the input text, and a speech generation stage to produce the desired output. These two distinct levels must exchange both data and commands to produce intelligible and natural speech. As the complete TTS task relies on many distinct scientific areas, any achievement toward standardization can minimize the effort and increase the dynamic of the results. This paper gives an overview of the NLP stage in the TTS system for Romanian language built by our collective, and describes the integration into the system of SSML (Speech Synthesis Markup Language), as a nowadays well recognized standard for TTS document authoring and inter-modules communication.
  • Keywords
    natural language processing; speech synthesis; Romanian language; natural language processing; speech generation; speech synthesis markup language; text to speech synthesis; Dictionaries; Natural language processing; Pragmatics; Speech; Stress; Testing; Training; NLP; SSML; TTS; diacritic restoration; letter-to-phone conversion; lexical stress assignment; syllabification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speech Technology and Human-Computer Dialogue (SpeD), 2011 6th Conference on
  • Conference_Location
    Brasov
  • Print_ISBN
    978-1-4577-0440-6
  • Type

    conf

  • DOI
    10.1109/SPED.2011.5940733
  • Filename
    5940733