DocumentCode :
2005116
Title :
Properties of language networks in Japanese Wikipedia
Author :
Sato, Hikaru ; Kubo, Momoji ; Namatame, Akira
Author_Institution :
Dept. of Comput. Sci., Nat. Defense Acad., Yokosuka, Japan
fYear :
2012
fDate :
20-24 Nov. 2012
Firstpage :
281
Lastpage :
284
Abstract :
Linguistic activity is highly complicated things that is produced from human brain. When the topic which is written or spoken becomes difficult, the produced sentence and article become more complex. Traditional analysis of the linguistic activity was based on the word frequency in use. Recently, the analysis based on the relation between word usage is attracting attention. These relation can be represented by network called “language networks.” Many findings from the research of complex networks can be applied to this area. In this study, we investigate cooccurrence networks that are made from Wikipedia´s article. Several network indices are used to classify the co-occurrence networks. We found that the co-occurrence networks made from the similar categories show the similarities in terms of indices.
Keywords :
Web sites; complex networks; linguistics; natural language processing; pattern classification; text analysis; Japanese wikipedia; Wikipedia article; complex networks; cooccurrence network classification; language networks; linguistic activity; word usage;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Soft Computing and Intelligent Systems (SCIS) and 13th International Symposium on Advanced Intelligent Systems (ISIS), 2012 Joint 6th International Conference on
Conference_Location :
Kobe
Print_ISBN :
978-1-4673-2742-8
Type :
conf
DOI :
10.1109/SCIS-ISIS.2012.6505197
Filename :
6505197
Link To Document :
بازگشت