DocumentCode :
2897803
Title :
Automatic summarization of Japanese sentences and its application to a WWW KWIC index
Author :
Kiyota, Youji ; Kurohashi, Sadao
Author_Institution :
Graduate Sch. of Inf., Kyoto Univ., Japan
fYear :
2001
fDate :
2001
Firstpage :
120
Lastpage :
127
Abstract :
This paper presents a system which creates a KWIC index of WWW texts in Japanese by automatic summarization. The system consists of three modules: a WWW spider, an extractor of important sentences, and a sentence summarizer. The most effective module is the last one which employs a robust and fairly accurate Japanese parser: KNP. It segments an input sentence into phrases or simple sentences and assembles a summary. The accuracy of the important sentence extractor was 62.8% and that of the sentence summarizer was 76.5%
Keywords :
Internet; abstracting; indexing; natural languages; search engines; Japanese parser; Japanese sentences; KNP; WWW KWIC index; WWW spider; World Wide Web; automatic summarization; directory indexes; extractor; important sentence extractor; important sentences; search engines; sentence summarizer; Assembly; Data mining; HTML; Informatics; Robustness; Search engines; World Wide Web;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications and the Internet, 2001. Proceedings. 2001 Symposium on
Conference_Location :
San Diego, CA
Print_ISBN :
0-7695-0942-8
Type :
conf
DOI :
10.1109/SAINT.2001.905175
Filename :
905175
Link To Document :
بازگشت