• DocumentCode
    1010094
  • Title

    Speech-to-text and speech-to-speech summarization of spontaneous speech

  • Author

    Furui, Sadaoki ; Kikuchi, Tomonori ; Shinnaka, Yousuke ; Hori, Chiori

  • Author_Institution
    Dept. of Comput. Sci., Tokyo Inst. of Technol., Japan
  • Volume
    12
  • Issue
    4
  • fYear
    2004
  • fDate
    7/1/2004 12:00:00 AM
  • Firstpage
    401
  • Lastpage
    408
  • Abstract
    This paper presents techniques for speech-to-text and speech-to-speech automatic summarization based on speech unit extraction and concatenation. For the former case, a two-stage summarization method consisting of important sentence extraction and word-based sentence compaction is investigated. Sentence and word units which maximize the weighted sum of linguistic likelihood, amount of information, confidence measure, and grammatical likelihood of concatenated units are extracted from the speech recognition results and concatenated for producing summaries. For the latter case, sentences, words, and between-filler units are investigated as units to be extracted from original speech. These methods are applied to the summarization of unrestricted-domain spontaneous presentations and evaluated by objective and subjective measures. It was confirmed that proposed methods are effective in spontaneous speech summarization.
  • Keywords
    speech processing; speech recognition; text analysis; between-filler units; concatenated units grammatical likelihood; confidence measure; information amount; linguistic likelihood weighted sum; speech recognition; speech unit concatenation; speech unit extraction; speech-to-speech summarization; speech-to-text summarization; spontaneous speech; unrestricted-domain spontaneous presentation; word-based sentence compaction; Broadcasting; Compaction; Concatenated codes; Data mining; Laboratories; Natural languages; Speech recognition; Speech synthesis; Synthesizers; Text recognition;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2004.828699
  • Filename
    1306513