• DocumentCode
    940402
  • Title

    Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence

  • Author

    Ananthakrishnan, Sankaranarayanan ; Narayanan, Shrikanth S.

  • Author_Institution
    Signal & Image Process. Inst. (SIPI), Univ. of Southern California, Los Angeles, CA
  • Volume
    16
  • Issue
    1
  • fYear
    2008
  • Firstpage
    216
  • Lastpage
    228
  • Abstract
    With the advent of prosody annotation standards such as tones and break indices (ToBI), speech technologists and linguists alike have been interested in automatically detecting prosodic events in speech. This is because the prosodic tier provides an additional layer of information over the short-term segment-level features and lexical representation of an utterance. As the prosody of an utterance is closely tied to its syntactic and semantic content in addition to its lexical content, knowledge of the prosodic events within and across utterances can assist spoken language applications such as automatic speech recognition and translation. On the other hand, corpora annotated with prosodic events are useful for building natural-sounding speech synthesizers. In this paper, we build an automatic detector and classifier for prosodic events in American English, based on their acoustic, lexical, and syntactic correlates. Following previous work in this area, we focus on accent (prominence, or ldquostressrdquo) and prosodic phrase boundary detection at the syllable level. Our experiments achieved a performance rate of 86.75% agreement on the accent detection task, and 91.61% agreement on the phrase boundary detection task on the Boston University Radio News Corpus.
  • Keywords
    acoustic signal processing; natural languages; signal classification; speech processing; speech recognition; speech synthesis; American English; acoustic evidence; automatic prosodic event classifier; automatic speech prosodic event detection; automatic speech recognition; automatic speech translation; lexical evidence; natural-sounding speech synthesizer; prosodic phrase boundary detection; prosody annotation standard; spoken language processing; syntactic evidence; Accent; prominence; prosodic phrase boundary; prosody recognition; prosody–syntax interface; prosody-syntax interface; spoken language processing; stress;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2007.907570
  • Filename
    4358088