Title :
Parameter tying for flexible speech recognition
Author :
Simonin, J. ; Bodin, S. ; Jouvet, D. ; Bartkova, K.
Author_Institution :
CNET, Lannion, France
Abstract :
The paper presents two parameter tying techniques which enable a trade-off between computational cost and recognition performances of a speaker independent flexible speech recognition system working over the telephone network. Parameter tying is conducted at phonetic and acoustic levels. At the phonetic level, allophone and triphone based phonetic modeling are used simultaneously to achieve the best trade-off between computational cost and recognition performances. This decreases error rate with a controlled computational cost as compared to an allophone modeling. At the acoustic level, the tying is performed by clustering the Gaussian densities of mixture distributions. After clustering, a particular density may be use by several distribution. This allows the total number of Gaussian densities to be divided by two while improving the recognition performances
Keywords :
speech recognition; telephone networks; Gaussian mixture distribution density clustering; acoustic level parameter tying; allophone based phonetic modeling; computational cost; error rate; flexible speech recognition; parameter tying techniques; phonetic level parameter tying; recognition performances; speaker independent flexible speech recognition system; telephone network; triphone based phonetic modeling; Acoustic emission; Computational efficiency; Context modeling; Error analysis; Hidden Markov models; Loudspeakers; Parameter estimation; Speech recognition; Telephony; Training data;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607795