• DocumentCode
    454694
  • Title

    Chinese Spoken Document Summarization Using Probabilistic Latent Topical Information

  • Author

    Chen, Berlin ; Yeh, Yao-Ming ; Huang, Yao-Min ; Yi-Ting Chen

  • Author_Institution
    Graduate Inst. of Comput. Sci. & Inf. Eng., Nat. Taiwan Normal Univ., Taipei
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    The purpose of extractive summarization is to automatically select a number of indicative sentences, passages, or paragraphs from the original document according to a target summarization ratio and then sequence them to form a concise summary. In the paper, we proposed the use of probabilistic latent topical information for extractive summarization of spoken documents. Various kinds of modeling structures and learning approaches were extensively investigated. In addition, the summarization capabilities were verified by comparison with the conventional vector space model and latent semantic indexing model, as well as the HMM model. The experiments were performed on the Chinese broadcast news collected in Taiwan. Noticeable performance gains were obtained
  • Keywords
    document handling; natural languages; probability; Chinese broadcast news; Chinese spoken document summarization; probabilistic latent topical information; Computer science; Data mining; Digital multimedia broadcasting; Hidden Markov models; Indexing; Man machine systems; Multimedia communication; Performance gain; Speech; Wireless communication;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660184
  • Filename
    1660184