Title :
Integrating thumbnail features for speech recognition using conditional exponential models
Author :
Yu, Hua ; Waibel, Alex
Author_Institution :
Interactive Syst. Labs, Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
We describe a novel approach for modeling segmental information in speech recognition, through the use of thumbnail features. By taking into account dependencies at the segmental level, thumbnail features are more resistant to changes in speaking rates and other factors. While the traditional acoustic features are fixed for every utterance, one set of thumbnail features is computed for each hypothesis, which may violate the traditional scoring paradigm. To this end, we introduce a conditional exponential modeling framework. It allows better integration of various knowledge sources in a discriminative fashion. We present preliminary experiments on the Switchboard task.
Keywords :
feature extraction; hidden Markov models; learning (artificial intelligence); speech recognition; HMM; Switchboard task; conditional exponential models; segmental information; speech recognition; thumbnail features; Handwriting recognition; Hidden Markov models; Image segmentation; Interactive systems; Machine learning; Pattern matching; Proposals; Robustness; Speech analysis; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326130