DocumentCode
2897803
Title
Automatic summarization of Japanese sentences and its application to a WWW KWIC index
Author
Kiyota, Youji ; Kurohashi, Sadao
Author_Institution
Graduate Sch. of Inf., Kyoto Univ., Japan
fYear
2001
fDate
2001
Firstpage
120
Lastpage
127
Abstract
This paper presents a system which creates a KWIC index of WWW texts in Japanese by automatic summarization. The system consists of three modules: a WWW spider, an extractor of important sentences, and a sentence summarizer. The most effective module is the last one which employs a robust and fairly accurate Japanese parser: KNP. It segments an input sentence into phrases or simple sentences and assembles a summary. The accuracy of the important sentence extractor was 62.8% and that of the sentence summarizer was 76.5%
Keywords
Internet; abstracting; indexing; natural languages; search engines; Japanese parser; Japanese sentences; KNP; WWW KWIC index; WWW spider; World Wide Web; automatic summarization; directory indexes; extractor; important sentence extractor; important sentences; search engines; sentence summarizer; Assembly; Data mining; HTML; Informatics; Robustness; Search engines; World Wide Web;
fLanguage
English
Publisher
ieee
Conference_Titel
Applications and the Internet, 2001. Proceedings. 2001 Symposium on
Conference_Location
San Diego, CA
Print_ISBN
0-7695-0942-8
Type
conf
DOI
10.1109/SAINT.2001.905175
Filename
905175
Link To Document