• Title of article

    Keyword extraction by entropy difference between the intrinsic and extrinsic mode

  • Author/Authors

    Yang، نويسنده , , Zhen and Lei، نويسنده , , Jianjun and Fan، نويسنده , , Kefeng and Lai، نويسنده , , Yingxu، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2013
  • Pages
    9
  • From page
    4523
  • To page
    4531
  • Abstract
    This paper proposes a new metric to evaluate and rank the relevance of words in a text. The method uses the Shannon’s entropy difference between the intrinsic and extrinsic mode, which refers to the fact that relevant words significantly reflect the author’s writing intention, i.e., their occurrences are modulated by the author’s purpose, while the irrelevant words are distributed randomly in the text. By using The Origin of Species by Charles Darwin as a representative text sample, the performance of our detector is demonstrated and compared to previous proposals. Since a reference text “corpus” is all of an author’s writings, books, papers, etc. his collected works is not needed. Our approach is especially suitable for single documents of which there is no a priori information available.
  • Keywords
    Intrinsic mode , Extrinsic mode , keyword extraction , Entropy difference
  • Journal title
    Physica A Statistical Mechanics and its Applications
  • Serial Year
    2013
  • Journal title
    Physica A Statistical Mechanics and its Applications
  • Record number

    1737295