Title :
Classification of RSS feed news items using ontology
Author :
Agarwal, Sankalp ; Singhal, Achintya ; Bedi, Punam
Author_Institution :
Dept. of Comput. Sci., Univ. of Delhi, Delhi, India
Abstract :
Explosive growth of data on the web demand techniques, which would enable the user to access desired information. In Information retrieval Document Classification is prerequisite. In practice many classification techniques were and are in use. Term Frequency-Inverse Document Frequency (TF-IDF) is an approach which represents documents based on the frequency of terms in documents. Limitation of this approach is high dimensionality of data. Moreover it does not consider the relations among the terms, resulting in less precise and noisy end result. In our approach we are using weighted Concept Frequency-Inverse Document Frequency (CF-IDF) with background knowledge of domain Ontology, for classification of RSS feed News Items. Metadata information of news items has been used to assign weight to the identified concepts. No trained classifiers are required as Ontology itself acts as a classifier. We have designed ontology based on news industry standards. This classification approach considers relations among the concepts and properties. It results in reduction of noise in final output. It considers only the key concepts of a domain for classification instead of all the terms, which curbs the problem of dimensionality. Evaluation of experimental results reveals that proposed approach gives better classification results.
Keywords :
Internet; document handling; information retrieval; meta data; ontologies (artificial intelligence); pattern classification; CF-IDF; RSS feed news item classification; TF-IDF; Web demand techniques; concept frequency inverse document frequency; data dimensionality; information retrieval document classification; metadata information; ontology; term frequency-inverse document frequency; Decision support systems; Films; Helium; Intelligent systems; CF-IDF; News Domain Ontology; RSS news feeds; Semantic classification;
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2012 12th International Conference on
Conference_Location :
Kochi
Print_ISBN :
978-1-4673-5117-1
DOI :
10.1109/ISDA.2012.6416587