Title :
A novel quasi-diphone inventory approach to Text-To-Speech synthesis
Author :
Gerazov, Branislav ; Shutinoski, Goce ; Arsov, Goce
Author_Institution :
Dept. of Electron., Univ. of Ss. Cyril & Methodius, Skopje
Abstract :
The paper presents a novel approach to concatenative text-to-speech synthesis. The system uses a unique optimized mixed-rank inventory, based on a modification of the classical diphone concept. A new unit type is introduced in our work, dubbed the quasi-diphone unit. A set of these units is designed to cover all the critical transitions between phones and at the same time to be compatible with phone-length units for concatenation purposes. This allows for inventory optimization in respect to its size and quality of the generated speech. The system includes elementary pitch, duration and amplitude modeling implemented with the standard PSOLA algorithm. Presented results show that it was possible to achieve full intelligibility and reasonable naturalness whilst maintaining a rather small inventory. The system was specially developed for the synthesis of Macedonian, and is the first HQ TTS system for this language. Using the developed standardized interface between the modules, the system is also applicable to any of the worldpsilas languages.
Keywords :
natural language processing; speech synthesis; Macedonian; amplitude modeling; elementary pitch; quasi-diphone inventory approach; text-to-speech synthesis; time domain pitch-synchronous overlap add algorithm; Analog computers; Helium; History; Human voice; Information technology; Natural languages; Spectrogram; Speech processing; Speech synthesis; Synthesizers; Macedonian; TTS; concatenative synthesis; mixed-rank inventory; quasi-diphone;
Conference_Titel :
Electrotechnical Conference, 2008. MELECON 2008. The 14th IEEE Mediterranean
Conference_Location :
Ajaccio
Print_ISBN :
978-1-4244-1632-5
Electronic_ISBN :
978-1-4244-1633-2
DOI :
10.1109/MELCON.2008.4618533