• DocumentCode
    2005116
  • Title

    Properties of language networks in Japanese Wikipedia

  • Author

    Sato, Hikaru ; Kubo, Momoji ; Namatame, Akira

  • Author_Institution
    Dept. of Comput. Sci., Nat. Defense Acad., Yokosuka, Japan
  • fYear
    2012
  • fDate
    20-24 Nov. 2012
  • Firstpage
    281
  • Lastpage
    284
  • Abstract
    Linguistic activity is highly complicated things that is produced from human brain. When the topic which is written or spoken becomes difficult, the produced sentence and article become more complex. Traditional analysis of the linguistic activity was based on the word frequency in use. Recently, the analysis based on the relation between word usage is attracting attention. These relation can be represented by network called “language networks.” Many findings from the research of complex networks can be applied to this area. In this study, we investigate cooccurrence networks that are made from Wikipedia´s article. Several network indices are used to classify the co-occurrence networks. We found that the co-occurrence networks made from the similar categories show the similarities in terms of indices.
  • Keywords
    Web sites; complex networks; linguistics; natural language processing; pattern classification; text analysis; Japanese wikipedia; Wikipedia article; complex networks; cooccurrence network classification; language networks; linguistic activity; word usage;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Soft Computing and Intelligent Systems (SCIS) and 13th International Symposium on Advanced Intelligent Systems (ISIS), 2012 Joint 6th International Conference on
  • Conference_Location
    Kobe
  • Print_ISBN
    978-1-4673-2742-8
  • Type

    conf

  • DOI
    10.1109/SCIS-ISIS.2012.6505197
  • Filename
    6505197