Title :
Maximum mutual information based acoustic-features representation of phonological features for speech recognition
Author :
Omar, M.Kamal ; Hasegawa-Johnson, Mark
Author_Institution :
University of Illinois at Urbana-Champaign, Department of Electrical And Computer Engineering, 61801, USA
Abstract :
This paper addresses the problem of finding a subset of the acoustic feature space that best represents a set of phonological features. A maximum mutual information approach is presented for selecting acoustic features to be combined together to represent the distinctions coded by a set of correlated phonological features. Each set of phonological features is chosen on the basis of acoustic phonetic similarity, so the sets can be considered approximately independent. This means that the output of recognizers that recognize these sets independently using the acoustic representation achieved by an algorithm presented in this paper can be combined together to increase efficiency and robustness of speech recognition systems. The mutual information between the phonological feature sets and their achieved acoustic representation is increased by up to 220% over the best single-type acoustic representation in the feature space of the same length.
Keywords :
Cepstrum; Mel frequency cepstral coefficient; Mutual information;
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.2002.5743659