DocumentCode :
3348557
Title :
ICA-based hierarchical text classification for multi-domain text-to-speech synthesis
Author :
Sevillano, X. ; Alias, F. ; Socoro, J.C.
Author_Institution :
Department of Communications and Signal Theory. Enginyeria i Arquitectura La Salle. Universitat Ramon Llull. Barcelona, Spain
Volume :
5
fYear :
2004
fDate :
17-21 May 2004
Lastpage :
697
Abstract :
In the framework of multi-domain text-to-speech synthesis, it is essential (i) to design a hierarchically structured database for allowing several domains in the same speech corpus and (ii) to include a text classification module that, at run time, assigns the input sentences to a domain or set of domains from the database. We present a hierarchical text classifier based on independent component analysis (ICA), which is capable of (i) organizing the contents of the corpus in a hierarchical manner and (ii) classifying the texts to be synthesized according to the learned structure. The document organization and classification performance of our ICA-based hierarchical classifier are evaluated in several encouraging experiments conducted on a journalistic-style text corpus for speech synthesis in Catalan.
Keywords :
database management systems; independent component analysis; natural languages; pattern classification; speech synthesis; text analysis; Catalan; ICA; hierarchical text classification; hierarchically structured database; independent component analysis; multi-domain text-to-speech synthesis; speech corpus; Buildings; Independent component analysis; Integrated circuit testing; Organizing; Signal design; Signal synthesis; Spatial databases; Speech enhancement; Speech synthesis; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Conference_Location :
Montreal, Que.
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1327206
Filename :
1327206
Link To Document :
بازگشت