مرکز منطقه ای اطلاع رساني علوم و فناوري - Database Pruning for Unsupervised Building of Text-To-Speech Voices

DocumentCode :

454680

Title :

Database Pruning for Unsupervised Building of Text-To-Speech Voices

Author :

Adell, Jordi ; Agüero, Pablo Daniel ; Bonafonte, Antonio

Author_Institution :

Dept. of Signal Theory & Comunications, Univ. Politecnica de Catalunya

Volume :

fYear :

2006

fDate :

14-19 May 2006

Abstract :

Unit selection speech synthesis techniques lead the speech synthesis state of the art. Automatic segmentation of databases is necessary in order to build new voices. They may contain errors and segmentation processes may introduce some more. Quality systems require a significant effort to find and correct these segmentation errors. Phonetic transcription is crucial and is one of the manually supervised tasks. The possibility to automatically remove incorrectly transcribed units from the inventory will help to make the process more automatic. Here we present a new technique based on speech recognition confidence measures that reaches to remove 90% of incorrectly transcribed units from a database. The cost for it is loosing only a 10% of correctly transcribed units

Keywords :

speech recognition; speech synthesis; automatic segmentation; database pruning; phonetic transcription; speech recognition; text-to-speech voices; unit selection speech synthesis; unsupervised building; Art; Costs; Databases; Error correction; Hidden Markov models; Inventory management; Natural languages; Speech recognition; Speech synthesis; Synthesizers;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on

Conference_Location :

Toulouse

ISSN :

1520-6149

Print_ISBN :

1-4244-0469-X

Type :

conf

DOI :

10.1109/ICASSP.2006.1660164

Filename :

1660164

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=454680