DocumentCode :
3243287
Title :
Emerging Topic Tracking System
Author :
Bun, Khoo Khyou ; Ishizuka, Mitsuru
Author_Institution :
Dept. of Inf. & Commun. Eng., Tokyo Univ., Japan
fYear :
2001
fDate :
2001
Firstpage :
2
Lastpage :
11
Abstract :
We designed a system that track the changes to a particular area of a user´s interests on the World Wide Web and to generate a summary of emerging topics back to the user. This system consists of three main components, which are the Area View System, the Web Spider and the Summary Generator. The Area View System, as a meta-search engine, directs the user´s keywords to a commercial search engine, obtains the hits, performs further analysis and derives a number of most relevant domain sites. Then, the Web Spider dispatches and scans all these domains at a certain time interval to collect all the modified and newly added HTML pages. Lastly, the Summary Generator extracts all the newly added sentences or changes from the collected HTML pages and then counts the term weights in the changes by adapting a newly innovated algorithm called TF*PDF (Term Frequency * Proportional Document Frequency). The terms that deem to explain the emerging topic are heavily weighted. The sentences with the highest average weight are extracted to form a summary of emerging topics. We refer to our system as the Emerging Topic Tracking System (ETTS)
Keywords :
hypermedia markup languages; information resources; relevance feedback; search engines; tracking; user modelling; Area View System; ETTS; Emerging Topic Tracking System; HTML pages; Summary Generator; TF*PDF algorithm; Web Spider; World Wide Web; keywords; meta-search engine; newly added sentences; proportional document frequency; relevant domain sites; term frequency; term weights; user interest change tracking; Design engineering; Difference engines; Frequency; HTML; Information analysis; Internet; Metasearch; Search engines; Uniform resource locators; Web page design;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Issues of E-Commerce and Web-Based Information Systems, WECWIS 2001, Third International Workshop on.
Conference_Location :
San Juan, CA
Print_ISBN :
0-7695-1224-0
Type :
conf
DOI :
10.1109/WECWIS.2001.933900
Filename :
933900
Link To Document :
بازگشت