• DocumentCode
    2897803
  • Title

    Automatic summarization of Japanese sentences and its application to a WWW KWIC index

  • Author

    Kiyota, Youji ; Kurohashi, Sadao

  • Author_Institution
    Graduate Sch. of Inf., Kyoto Univ., Japan
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    120
  • Lastpage
    127
  • Abstract
    This paper presents a system which creates a KWIC index of WWW texts in Japanese by automatic summarization. The system consists of three modules: a WWW spider, an extractor of important sentences, and a sentence summarizer. The most effective module is the last one which employs a robust and fairly accurate Japanese parser: KNP. It segments an input sentence into phrases or simple sentences and assembles a summary. The accuracy of the important sentence extractor was 62.8% and that of the sentence summarizer was 76.5%
  • Keywords
    Internet; abstracting; indexing; natural languages; search engines; Japanese parser; Japanese sentences; KNP; WWW KWIC index; WWW spider; World Wide Web; automatic summarization; directory indexes; extractor; important sentence extractor; important sentences; search engines; sentence summarizer; Assembly; Data mining; HTML; Informatics; Robustness; Search engines; World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications and the Internet, 2001. Proceedings. 2001 Symposium on
  • Conference_Location
    San Diego, CA
  • Print_ISBN
    0-7695-0942-8
  • Type

    conf

  • DOI
    10.1109/SAINT.2001.905175
  • Filename
    905175