Title :
Database Pruning for Unsupervised Building of Text-To-Speech Voices
Author :
Adell, Jordi ; Agüero, Pablo Daniel ; Bonafonte, Antonio
Author_Institution :
Dept. of Signal Theory & Comunications, Univ. Politecnica de Catalunya
Abstract :
Unit selection speech synthesis techniques lead the speech synthesis state of the art. Automatic segmentation of databases is necessary in order to build new voices. They may contain errors and segmentation processes may introduce some more. Quality systems require a significant effort to find and correct these segmentation errors. Phonetic transcription is crucial and is one of the manually supervised tasks. The possibility to automatically remove incorrectly transcribed units from the inventory will help to make the process more automatic. Here we present a new technique based on speech recognition confidence measures that reaches to remove 90% of incorrectly transcribed units from a database. The cost for it is loosing only a 10% of correctly transcribed units
Keywords :
speech recognition; speech synthesis; automatic segmentation; database pruning; phonetic transcription; speech recognition; text-to-speech voices; unit selection speech synthesis; unsupervised building; Art; Costs; Databases; Error correction; Hidden Markov models; Inventory management; Natural languages; Speech recognition; Speech synthesis; Synthesizers;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
Print_ISBN :
1-4244-0469-X
DOI :
10.1109/ICASSP.2006.1660164