• DocumentCode
    2963722
  • Title

    Detection of OOV words by combining acoustic confidence measures with linguistic features

  • Author

    Stouten, Frederik ; Fohr, Dominique ; Illina, Irina

  • Author_Institution
    Speech Group, LORIA-INRIA, Vandoeuvre-les-Nancy, France
  • fYear
    2009
  • fDate
    Nov. 13 2009-Dec. 17 2009
  • Firstpage
    371
  • Lastpage
    375
  • Abstract
    This paper describes the design of an out-of-vocabulary words (OOV) detector. Such a system is assumed to detect segments that correspond to OOV words (words that are not included in the lexicon) in the output of a LVCSR system. The OOV detector uses acoustic confidence measures that are derived from several systems: a word recognizer constrained by a lexicon, a phone recognizer constrained by a grammar and a phone recognizer without constraints. On top of that it also uses some linguistic features. The experimental results on a French broadcast news transcription task showed that for our approach precision equals recall at 35%.
  • Keywords
    grammars; natural language processing; signal detection; speech recognition; vocabulary; French broadcast news transcription task; OOV word detection; acoustic confidence measures; grammar; large vocabulary speech recognition; linguistic features; out-of-vocabulary word detector; phone recognizer; word recognizer; Acoustic measurements; Acoustic signal detection; Broadcasting; Detectors; Feature extraction; Neural networks; Speech recognition; Vocabulary; LVCSR; OOV; confidence measures;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
  • Conference_Location
    Merano
  • Print_ISBN
    978-1-4244-5478-5
  • Electronic_ISBN
    978-1-4244-5479-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2009.5372877
  • Filename
    5372877