• DocumentCode
    3231020
  • Title

    The use of emphasis to automatically summarize a spoken discourse

  • Author

    Chen, Francine R. ; Withgott, Margaret

  • Author_Institution
    Xerox Palo Alto Res. Center, CA, USA
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    229
  • Abstract
    The authors describe a method for exploiting prosodic information in natural, conventional speed for the purpose of automatically creating an audio summary. The method is based on identifying emphasized speech and then using proximity measures on the emphasized regions to select summarizing excerpts. Emphasized speech is recognized using a hidden Markov model using only non-spectral, periodic information. Syllable-based models were created and the models trained on spontaneous speech in which words had been labeled by a panel of listeners for degree of emphasis. Emphatic speech from one speaker was automatically detected and summarizing excerpts were identified, with no noticeable difference when compared to excerpts selected by individual subjects. The extensibility of the emphasis detector to other speakers was tested on a small sample of telephone speech by ten other speakers
  • Keywords
    hidden Markov models; speech recognition; audio summary; emphatic speech; hidden Markov model; prosodic information; summarizing excerpts; syllable-based models; telephone speech; Automatic speech recognition; Detectors; Event detection; Hidden Markov models; Pattern recognition; Speech recognition; Statistical analysis; Stress; Telephony; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225930
  • Filename
    225930