DocumentCode :
3211389
Title :
Topic extraction from news archive using TF*PDF algorithm
Author :
Bun, Khoo Khyou ; Ishizuka, Mitsuru
Author_Institution :
Dept. of Inf. & Commun. Eng., Univ. of Tokyo, Japan
fYear :
2002
fDate :
12-14 Dec. 2002
Firstpage :
73
Lastpage :
82
Abstract :
Since the Web became widespread, the amount of electronically available information online, especially news archives, has proliferated and threatens to become overwhelming. We propose an information system that will extract main topics in a news archive on a weekly basis. By obtaining a weekly report, a user can know what the main news events were in the past week.
Keywords :
information resources; information retrieval systems; TF*PDF algorithm; Web; information system; news archive; topic extraction; weekly report; Algorithm design and analysis; Broadcasting; Content based retrieval; Data mining; Event detection; Frequency; Humans; Information retrieval; Information systems; Intelligent systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Information Systems Engineering, 2002. WISE 2002. Proceedings of the Third International Conference on
Print_ISBN :
0-7695-1766-8
Type :
conf
DOI :
10.1109/WISE.2002.1181645
Filename :
1181645
Link To Document :
بازگشت