• DocumentCode
    2974068
  • Title

    Lattice-based lexical cues for word fragment detection in conversational speech

  • Author

    Audhkhasi, Kartik ; Georgiou, Panayiotis ; Narayanan, Shrikanth

  • Author_Institution
    Electr. Eng. Dept., Univ. of Southern California, Los Angeles, CA, USA
  • fYear
    2009
  • fDate
    Nov. 13 2009-Dec. 17 2009
  • Firstpage
    568
  • Lastpage
    573
  • Abstract
    Previous approaches to the problem of word fragment detection in speech have focussed primarily on acoustic-prosodic features. This paper proposes that the output of a continuous automatic speech recognition (ASR) system can also be used to derive robust lexical features for the task. We hypothesize that the confusion in the word lattice generated by the ASR system can be exploited for detecting word fragments. Two sets of lexical features are proposed -one which is based on the word confusion, and the other based on the pronunciation confusion between the word hypotheses in the lattice. Classification experiments with a support vector machine (SVM) classifier show that these lexical features perform better than the previously proposed acoustic-prosodic features by around 5.20% (relative) on a corpus chosen from the DARPA Transtac Iraqi-English (San Diego) corpus. A combination of both these feature sets improves the word fragment detection accuracy by 11.50% relative to using just the acoustic-prosodic features.
  • Keywords
    signal classification; speech recognition; support vector machines; text analysis; ASR system; SVM classifier; automatic speech recognition; conversational speech; lattice-based lexical cues; pronunciation confusion; support vector machine; word confusion; word fragment detection; word hypothesis; Acoustic signal detection; Automatic speech recognition; Lattices; Robustness; Speech analysis; Speech synthesis; Support vector machine classification; Support vector machines; Surface-mount technology; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
  • Conference_Location
    Merano
  • Print_ISBN
    978-1-4244-5478-5
  • Electronic_ISBN
    978-1-4244-5479-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2009.5373419
  • Filename
    5373419