DocumentCode :
2262632
Title :
Context modeling and clustering in continuous speech recognition
Author :
Junqua, Jean-Claude ; Vassallo, Lorenzo
Author_Institution :
Speech Technol. Lab., Panasonic Technol. Inc., Santa Barbara, CA, USA
Volume :
4
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
2262
Abstract :
Reports on the performance of two variants of well-known statistical-based clustering techniques and presents an evaluation on the TIMIT and TI-Digit databases. A clustering approach which (1) is based on a divergence criterion, (2) separates “good” and “bad” models using a class-dependent adjustable threshold on the number of examples per model, and (3) guides the clustering by limiting the number of models per class between two constants Nmin and Nmax, gave the best results. On the TI-Digit database, the combination of triphone modeling and divergence-based clustering yielded greater accuracy than that obtained with word models for a similar system complexity
Keywords :
modelling; software performance evaluation; speech recognition; statistical analysis; TI-Digit database; TIMIT database; accuracy; class-dependent adjustable threshold; context modeling; continuous speech recognition; divergence criterion; model examples; model separation; performance evaluation; statistical-based clustering techniques; system complexity; triphone modeling; word models; Automatic speech recognition; Context modeling; Databases; Decision trees; Hidden Markov models; Laboratories; Performance evaluation; Speech recognition; Training data; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607257
Filename :
607257
Link To Document :
بازگشت