• DocumentCode
    2665296
  • Title

    Automatic keywords extraction of Chinese document using small world structure

  • Author

    Mengxiao, Zhu ; Zhi, Cai ; Qingsheng, Cai

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Sci. & Technol. of China, Hefei, China
  • fYear
    2003
  • fDate
    26-29 Oct. 2003
  • Firstpage
    438
  • Lastpage
    443
  • Abstract
    Small world structure characterized by short characteristic path length and high clustering coefficient is widely observed in many natural and man-made systems. Inspired by the small word structure found in English, we analyzed Chinese documents and construct cooccurrence networks to representing the correlations of words, where nodes are words and the cooccurrences of words in the same sentence make up links. The cooccurrence networks show obvious small world properties. By utilizing the impact of each node´s absence on the characteristic path length of the network, we extract keywords, which can well represent the contents of the document.
  • Keywords
    natural languages; word processing; Chinese document; automatic keyword extraction; cooccurrence network; word correlation; Biological system modeling; Books; Bridges; Character generation; Computer science; Humans; Mesh generation; Natural languages; Power grids; Social network services;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
  • Conference_Location
    Beijing, China
  • Print_ISBN
    0-7803-7902-0
  • Type

    conf

  • DOI
    10.1109/NLPKE.2003.1275946
  • Filename
    1275946