• DocumentCode
    2964655
  • Title

    Local and global models for spontaneous speech segment detection and characterization

  • Author

    Dufour, Richard ; Estève, Yannick ; Deléglise, Paul ; Béchet, Frédéric

  • Author_Institution
    LIUM, Univ. of Le Mans, Le Mans, France
  • fYear
    2009
  • fDate
    Nov. 13 2009-Dec. 17 2009
  • Firstpage
    558
  • Lastpage
    561
  • Abstract
    Processing spontaneous speech is one of the many challenges that automatic speech recognition (ASR) systems have to deal with. The main evidences characterizing spontaneous speech are disfluencies (filled pause, repetition, repair and false start) and many studies have focused on the detection and the correction of these disfluencies. In this study we define spontaneous speech as unprepared speech, in opposition to prepared speech where utterances contain well-formed sentences close to those that can be found in written documents. Disfluencies are of course very good indicators of unprepared speech, however they are not the only ones: ungrammaticality and language register are also important as well as prosodic patterns. This paper proposes a set of acoustic and linguistic features that can be used for characterizing and detecting spontaneous speech segments from large audio databases. More, we introduce a strategy that takes advantage of a global classification procfalseess using a probabilistic model which significantly improves the spontaneous speech detection.
  • Keywords
    speech recognition; audio databases; automatic speech recognition; false start; filled pause; spontaneous speech segment detection; unprepared speech; Acoustic signal detection; Audio databases; Automatic speech recognition; Broadcasting; Data mining; Natural languages; Protocols; Speech analysis; Speech enhancement; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
  • Conference_Location
    Merano
  • Print_ISBN
    978-1-4244-5478-5
  • Electronic_ISBN
    978-1-4244-5479-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2009.5372928
  • Filename
    5372928