• DocumentCode
    2166040
  • Title

    Performance of Marathi language TTS synthesis based on perceptual test and spectrogram analysis

  • Author

    Bormane, Dattatraya S. ; Shirbahadurkar, S.D. ; Shiurka, U.D.

  • Author_Institution
    JSPM´s, Rajarshi Shahu Coll. of Eng., Pune, India
  • Volume
    3
  • fYear
    2010
  • fDate
    26-28 Feb. 2010
  • Firstpage
    40
  • Lastpage
    43
  • Abstract
    This paper describes the work based on concatenative text-tospeech synthesis system. It discusses a few perceptual and spectrogram experiments conducted on Marathi Voices (Spoken in Maharashtra, India). Marathi speech synthesizer is developed using different choice of units: words, phonemes as a database. We have synthesized the Marathi text and conducted the perceptual tests, as a result, (1) 74% of speech synthesized by the proposed method was preferred to that by the conventional method, (2) the mean opinion score (MOS) was 3.94 in a five-point MOS test, and 87% of the synthesized speech had the same naturalness as natural speech w.r.t. 40 samples taken from various slot of databases (3) Histogram for various speech databases shows the effectiveness of the proposed method. (4) Spectrogram analysis of various words concatenated with phonemes, syllables as a unit.
  • Keywords
    natural language processing; speech processing; speech synthesis; Marathi Voices; Marathi language TTS synthesis; concatenative text-tospeech synthesis system; database; five-point MOS test; histogram; mean opinion score; perceptual test; phonemes; spectrogram analysis; speech synthesizer; words; Concatenated codes; Databases; Histograms; Natural languages; Performance analysis; Spectrogram; Speech analysis; Speech synthesis; Synthesizers; Testing; Speech synthesis; concatenation; histogram; unit size;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Automation Engineering (ICCAE), 2010 The 2nd International Conference on
  • Conference_Location
    Singapore
  • Print_ISBN
    978-1-4244-5585-0
  • Electronic_ISBN
    978-1-4244-5586-7
  • Type

    conf

  • DOI
    10.1109/ICCAE.2010.5451850
  • Filename
    5451850