DocumentCode :
1467879
Title :
Data-driven approach to designing compound words for continuous speech recognition
Author :
Saon, George ; Padmanabhan, Mukund
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
9
Issue :
4
fYear :
2001
fDate :
5/1/2001 12:00:00 AM
Firstpage :
327
Lastpage :
332
Abstract :
We present a new approach to deriving compound words from a training corpus. The motivation for making compound words is because under some assumptions, speech recognition errors occur less frequently in longer words. Furthermore, they also enable more accurate modeling of pronunciation variability at the boundary between adjacent words in a continuously spoken utterance. We introduce a measure based on the product between the direct and the reverse bigram probability of a pair of words for finding candidate pairs in order to create compound words. Our experimental results show that by augmenting both the acoustic vocabulary and the language model with these new tokens, the word recognition accuracy can be improved by absolute 2.8% (7% relative) on a voice mail continuous speech recognition task. We also compare the proposed measure for selecting compound words with other measures that have been described in the literature
Keywords :
natural languages; probability; speech recognition; voice mail; acoustic vocabulary; adjacent words boundary; compound words design; continuously spoken utterance; data-driven approach; direct bigram probability; language model; pronunciation variability; reverse bigram probability; speech recognition errors; training corpus; voice mail continuous speech recognition; word recognition accuracy; Acoustic measurements; Decoding; Error analysis; Natural languages; Speech recognition; Telephony; Training data; Vocabulary; Voice mail;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.917678
Filename :
917678
Link To Document :
بازگشت