DocumentCode
2166040
Title
Performance of Marathi language TTS synthesis based on perceptual test and spectrogram analysis
Author
Bormane, Dattatraya S. ; Shirbahadurkar, S.D. ; Shiurka, U.D.
Author_Institution
JSPM´s, Rajarshi Shahu Coll. of Eng., Pune, India
Volume
3
fYear
2010
fDate
26-28 Feb. 2010
Firstpage
40
Lastpage
43
Abstract
This paper describes the work based on concatenative text-tospeech synthesis system. It discusses a few perceptual and spectrogram experiments conducted on Marathi Voices (Spoken in Maharashtra, India). Marathi speech synthesizer is developed using different choice of units: words, phonemes as a database. We have synthesized the Marathi text and conducted the perceptual tests, as a result, (1) 74% of speech synthesized by the proposed method was preferred to that by the conventional method, (2) the mean opinion score (MOS) was 3.94 in a five-point MOS test, and 87% of the synthesized speech had the same naturalness as natural speech w.r.t. 40 samples taken from various slot of databases (3) Histogram for various speech databases shows the effectiveness of the proposed method. (4) Spectrogram analysis of various words concatenated with phonemes, syllables as a unit.
Keywords
natural language processing; speech processing; speech synthesis; Marathi Voices; Marathi language TTS synthesis; concatenative text-tospeech synthesis system; database; five-point MOS test; histogram; mean opinion score; perceptual test; phonemes; spectrogram analysis; speech synthesizer; words; Concatenated codes; Databases; Histograms; Natural languages; Performance analysis; Spectrogram; Speech analysis; Speech synthesis; Synthesizers; Testing; Speech synthesis; concatenation; histogram; unit size;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Automation Engineering (ICCAE), 2010 The 2nd International Conference on
Conference_Location
Singapore
Print_ISBN
978-1-4244-5585-0
Electronic_ISBN
978-1-4244-5586-7
Type
conf
DOI
10.1109/ICCAE.2010.5451850
Filename
5451850
Link To Document