DocumentCode
3348557
Title
ICA-based hierarchical text classification for multi-domain text-to-speech synthesis
Author
Sevillano, X. ; Alias, F. ; Socoro, J.C.
Author_Institution
Department of Communications and Signal Theory. Enginyeria i Arquitectura La Salle. Universitat Ramon Llull. Barcelona, Spain
Volume
5
fYear
2004
fDate
17-21 May 2004
Lastpage
697
Abstract
In the framework of multi-domain text-to-speech synthesis, it is essential (i) to design a hierarchically structured database for allowing several domains in the same speech corpus and (ii) to include a text classification module that, at run time, assigns the input sentences to a domain or set of domains from the database. We present a hierarchical text classifier based on independent component analysis (ICA), which is capable of (i) organizing the contents of the corpus in a hierarchical manner and (ii) classifying the texts to be synthesized according to the learned structure. The document organization and classification performance of our ICA-based hierarchical classifier are evaluated in several encouraging experiments conducted on a journalistic-style text corpus for speech synthesis in Catalan.
Keywords
database management systems; independent component analysis; natural languages; pattern classification; speech synthesis; text analysis; Catalan; ICA; hierarchical text classification; hierarchically structured database; independent component analysis; multi-domain text-to-speech synthesis; speech corpus; Buildings; Independent component analysis; Integrated circuit testing; Organizing; Signal design; Signal synthesis; Spatial databases; Speech enhancement; Speech synthesis; Text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Conference_Location
Montreal, Que.
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1327206
Filename
1327206
Link To Document