• DocumentCode
    180182
  • Title

    Out-of-vocabulary word detection in a speech-to-speech translation system

  • Author

    Hong-Kwang Kuo ; Kislal, Ellen Eide ; Mangu, Lidia ; Soltau, Hagen ; Beran, Tomas

  • Author_Institution
    IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    7108
  • Lastpage
    7112
  • Abstract
    In this paper we describe progress we have made in detecting out-of-vocabulary words (OOVs) for a speech-to-speech translation system for the purpose of playing back audio to the user for clarification and correction. Our OOV detector follows a strategy of first identifying a rough location of the OOV and then merging adjacent decoded words to cover the true OOV word. We show the advantage of our OOV detection strategy and report on improvements using a real-time implementation of a new Convolutional Neural Network acoustic model. We discuss why commonly used metrics for OOV detection do not meet our needs and explore an overlap metric as well as a Jaccard metric for evaluating our ability to detect the OOVs and localize them accurately in time. We have found different metrics to be useful at different stages of development.
  • Keywords
    natural language processing; neural nets; speech processing; Jaccard metric; OOV detection; OOV localization; convolutional neural network acoustic model; out-of-vocabulary word detection; overlap metric; speech-to-speech translation system; Acoustics; Detectors; Measurement; Merging; Speech; Speech recognition; Vocabulary; OOV; metric; speech-to-speech translation system;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854979
  • Filename
    6854979