• DocumentCode
    679814
  • Title

    Implementation of malayalam text to speech using concatenative based TTS for android platform

  • Author

    Gopi, A. ; Shobana, P. Devi ; Sajini, T. ; Bhadran, V.K.

  • Author_Institution
    Language Technol. Sect., Centre for Dev. of Adv. Comput., Thiruvananthapuram, India
  • fYear
    2013
  • fDate
    13-15 Dec. 2013
  • Firstpage
    184
  • Lastpage
    189
  • Abstract
    The recent development in text to speech has been switched to concatenative synthesis, either using original speech segments or parametric synthesis. The former TTS system gives a better quality output since they use the original speech segment for concatenation. There are a number of different other techniques for speech generation like PSOLA, TDPSOLA, EMBROLA etc. This paper describes the development and implementation of concatenative based system based on Epoch Synchronous Non Overlap and Add (ESNOLA) technique for Malayalam in android platform. The TTS uses diphone like segments (partneme) as the basic units for concatenation. The database contain 1500 partnemes, which are used for generating speech for unlimited domain text. The paper also briefs about the implementation of Malayalam TTS, the database generation, the modification done for android platform, the database access and handling Malayalam character display in android platform, the support provided in the TTS app for displaying characters with proper rendering. The app support android latest versions upto android 4.2. The design for Newsreader, an application using android is also discussed in this paper. The TTS gives a Mean Opinion Score (MOS) of 3.2 in the perceptual test..
  • Keywords
    Android (operating system); concatenated codes; natural language processing; speech synthesis; Android platform; EMBROLA; ESNOLA technique; Epoch Synchronous NonOverlap and Add technique; MOS; Malayalam character display handling; Malayalam text to speech; Newsreader design; TDPSOLA; concatenation; concatenative based TTS; concatenative synthesis; database access; mean opinion score; parametric synthesis; partneme; perceptual test; rendering; speech generation; speech segments; Androids; Databases; Humanoid robots; Mobile communication; Smart phones; Speech; Synthesizers; concatenative synthesis; esnola; partneme; psola; text normalisation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control Communication and Computing (ICCC), 2013 International Conference on
  • Conference_Location
    Thiruvananthapuram
  • Print_ISBN
    978-1-4799-0573-7
  • Type

    conf

  • DOI
    10.1109/ICCC.2013.6731647
  • Filename
    6731647