Title :
Extracting domain knowledge by complex networks analysis of Wikipedia entries
Author :
Matas, Neven ; Martincic-Ipsic, Sanda ; Mestrovic, Ana
Author_Institution :
Fac. of Humanities & Social Sci., Univ. of Rijeka, Rijeka, Croatia
Abstract :
In this paper we describe a complex networks analysis of Wikipedia. We construct 10 different networks from Wikipedia entries (articles) related to the chosen domain. The goal of the experiment is to extract domain knowledge in terms of identifying entries that are centrally positioned and entries that are strongly related. We apply complex networks analysis on all acquired networks and examine the networks´ structure. We employ centrality measures in order to find centrally positioned entries in the network. Furthermore we identify communities and find which entries are densely connected according to the network structure.
Keywords :
Web sites; Wikipedia entries; centrality measures; complex networks analysis; domain knowledge extraction; network structure; Communities; Complex networks; Computer languages; Electronic publishing; Encyclopedias; Internet;
Conference_Titel :
Information and Communication Technology, Electronics and Microelectronics (MIPRO), 2015 38th International Convention on
Conference_Location :
Opatija
DOI :
10.1109/MIPRO.2015.7160531