DocumentCode
353504
Title
Discriminative resolution enhancement in acoustic modelling
Author
Duchateau, Jacques ; Demuynck, Kris ; Wambacq, Patrick
Author_Institution
ESAT, Katholieke Univ., Leuven, Heverlee, Belgium
Volume
3
fYear
2000
fDate
2000
Firstpage
1245
Abstract
The accuracy of the acoustic models in large vocabulary recognition systems can be improved by increasing the resolution in the acoustic feature space. This can be obtained by increasing the number of Gaussian densities in the models by splitting of the Gaussians. This paper proposes a novel algorithm for this splitting operation. It is based on the phonetic decision tree used for the state tying in context dependent modelling. The advantage of the method is that it improves the capability of the acoustic models to discriminate between the different tied states. The proposed splitting algorithm was evaluated on the Wall Street Journal recognition task. Comparison with a commonly used splitting algorithm clearly shows that our method can provide smaller (thus faster) acoustic models and results in lower error rates
Keywords
Gaussian processes; decision trees; speech enhancement; speech recognition; Gaussian densities; acoustic feature space; acoustic modelling; context dependent modelling; discriminative resolution enhancement; error rates; large vocabulary recognition systems; phonetic decision tree; state tying; Context modeling; Decision trees; Error analysis; Mutual information; Speech recognition; Tail; Training data; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location
Istanbul
ISSN
1520-6149
Print_ISBN
0-7803-6293-4
Type
conf
DOI
10.1109/ICASSP.2000.861801
Filename
861801
Link To Document