Title :
TSM. Topic Selection Method of Web Documents
Author :
Myonggwon Hwang ; Kong, Hyunjang ; Baek, Sunkyoung ; Kim, Pankoo
Author_Institution :
Dept. of Comput. Sci., Chosun Univ., Gwangju
Abstract :
In this paper, we propose a topic selection method about Web documents. The idea of our approach is to utilize an ontology structure and TF (term frequency) values of each term. For improving the performance of documents clustering, our research is strongly demanded. We process Web documents for keywords acquisition using TF values and relevancy values between terms using relations defined in WordNet. And then, we proposed the topic selection formula as we consider three kinds of cases during the topic selection. In conclusion, we demonstrate that our approach is very useful for the topic selection of documents
Keywords :
Internet; document handling; ontologies (artificial intelligence); Web documents; WordNet; documents clustering; ontology structure; term frequency values; topic selection method; Asia; Computer aided software engineering; Computer science; Frequency measurement; Internet; Natural languages; Ontologies; Particle measurements; Societies;
Conference_Titel :
Modelling & Simulation, 2007. AMS '07. First Asia International Conference on
Conference_Location :
Phuket
Print_ISBN :
0-7695-2845-7
DOI :
10.1109/AMS.2007.108