DocumentCode
2005116
Title
Properties of language networks in Japanese Wikipedia
Author
Sato, Hikaru ; Kubo, Momoji ; Namatame, Akira
Author_Institution
Dept. of Comput. Sci., Nat. Defense Acad., Yokosuka, Japan
fYear
2012
fDate
20-24 Nov. 2012
Firstpage
281
Lastpage
284
Abstract
Linguistic activity is highly complicated things that is produced from human brain. When the topic which is written or spoken becomes difficult, the produced sentence and article become more complex. Traditional analysis of the linguistic activity was based on the word frequency in use. Recently, the analysis based on the relation between word usage is attracting attention. These relation can be represented by network called “language networks.” Many findings from the research of complex networks can be applied to this area. In this study, we investigate cooccurrence networks that are made from Wikipedia´s article. Several network indices are used to classify the co-occurrence networks. We found that the co-occurrence networks made from the similar categories show the similarities in terms of indices.
Keywords
Web sites; complex networks; linguistics; natural language processing; pattern classification; text analysis; Japanese wikipedia; Wikipedia article; complex networks; cooccurrence network classification; language networks; linguistic activity; word usage;
fLanguage
English
Publisher
ieee
Conference_Titel
Soft Computing and Intelligent Systems (SCIS) and 13th International Symposium on Advanced Intelligent Systems (ISIS), 2012 Joint 6th International Conference on
Conference_Location
Kobe
Print_ISBN
978-1-4673-2742-8
Type
conf
DOI
10.1109/SCIS-ISIS.2012.6505197
Filename
6505197
Link To Document