DocumentCode :
723476
Title :
Extracting domain knowledge by complex networks analysis of Wikipedia entries
Author :
Matas, Neven ; Martincic-Ipsic, Sanda ; Mestrovic, Ana
Author_Institution :
Fac. of Humanities & Social Sci., Univ. of Rijeka, Rijeka, Croatia
fYear :
2015
fDate :
25-29 May 2015
Firstpage :
1622
Lastpage :
1627
Abstract :
In this paper we describe a complex networks analysis of Wikipedia. We construct 10 different networks from Wikipedia entries (articles) related to the chosen domain. The goal of the experiment is to extract domain knowledge in terms of identifying entries that are centrally positioned and entries that are strongly related. We apply complex networks analysis on all acquired networks and examine the networks´ structure. We employ centrality measures in order to find centrally positioned entries in the network. Furthermore we identify communities and find which entries are densely connected according to the network structure.
Keywords :
Web sites; Wikipedia entries; centrality measures; complex networks analysis; domain knowledge extraction; network structure; Communities; Complex networks; Computer languages; Electronic publishing; Encyclopedias; Internet;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Communication Technology, Electronics and Microelectronics (MIPRO), 2015 38th International Convention on
Conference_Location :
Opatija
Type :
conf
DOI :
10.1109/MIPRO.2015.7160531
Filename :
7160531
Link To Document :
بازگشت