DocumentCode :
3622709
Title :
Using DMoz for constructing ontology from data stream
Author :
M. Grobelnik;J. Brank;D. Mladenic;B. Novak;B. Fortuna
Author_Institution :
Jozef Stefan Inst., Ljubljana
fYear :
2006
fDate :
6/28/1905 12:00:00 AM
Firstpage :
439
Lastpage :
444
Abstract :
This paper presents an approach for constructing an ontology from a stream of documents. Named entities extracted from the documents are used as instances of the ontology. Entities and co-occurring entity pairs are represented by feature vectors based on the content of the documents where they occurred. In general, concepts and relations can be formed into an ontological structure either by clustering or by classification into an existing topic hierarchy. We propose the latter using DMoz as an existing topic hierarchy. The approach is efficient and can scale to large data sets. We propose a framework that incorporates the stream mining process into a formal definition of the ontology. We describe a software component implementing this approach, and present experiments using a large collection of news
Keywords :
"Ontologies","Data mining","Data processing","Hardware","Companies","Algorithm design and analysis","Sensor systems"
Publisher :
ieee
Conference_Titel :
Information Technology Interfaces, 2006. 28th International Conference on
ISSN :
1330-1012
Print_ISBN :
953-7138-05-4
Type :
conf
DOI :
10.1109/ITI.2006.1708521
Filename :
1708521
Link To Document :
بازگشت