DocumentCode
180182
Title
Out-of-vocabulary word detection in a speech-to-speech translation system
Author
Hong-Kwang Kuo ; Kislal, Ellen Eide ; Mangu, Lidia ; Soltau, Hagen ; Beran, Tomas
Author_Institution
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
fYear
2014
fDate
4-9 May 2014
Firstpage
7108
Lastpage
7112
Abstract
In this paper we describe progress we have made in detecting out-of-vocabulary words (OOVs) for a speech-to-speech translation system for the purpose of playing back audio to the user for clarification and correction. Our OOV detector follows a strategy of first identifying a rough location of the OOV and then merging adjacent decoded words to cover the true OOV word. We show the advantage of our OOV detection strategy and report on improvements using a real-time implementation of a new Convolutional Neural Network acoustic model. We discuss why commonly used metrics for OOV detection do not meet our needs and explore an overlap metric as well as a Jaccard metric for evaluating our ability to detect the OOVs and localize them accurately in time. We have found different metrics to be useful at different stages of development.
Keywords
natural language processing; neural nets; speech processing; Jaccard metric; OOV detection; OOV localization; convolutional neural network acoustic model; out-of-vocabulary word detection; overlap metric; speech-to-speech translation system; Acoustics; Detectors; Measurement; Merging; Speech; Speech recognition; Vocabulary; OOV; metric; speech-to-speech translation system;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854979
Filename
6854979
Link To Document