• DocumentCode
    2183807
  • Title

    Effective summarization method of text documents

  • Author

    Alguliev, Rasim M. ; Aliguliyev, Ramiz M.

  • Author_Institution
    Inst. of Inf. Technol., Azerbaijan Nat. Acad. of Sci., Baku, Azerbaijan
  • fYear
    2005
  • fDate
    19-22 Sept. 2005
  • Firstpage
    264
  • Lastpage
    271
  • Abstract
    In this paper, we propose text summarization method that creates text summary by definition of the relevance score of each sentence and extracting sentences from the original documents. This summarization method takes into account the weight of each sentence in the document. The essence of the method suggested is in preliminary identification of every sentence in the document with characteristic vector of words, which appear in the document, and calculation of relevance score for each sentence. The relevance score of sentence is determined through its comparison with all the other sentences in the document and with the document title by cosine measure. Prior to application of this method, the scope of features is defined and then the weight of each word in the sentence is calculated with account of those features. The weights of features, influencing relevance of words, are determined using genetic algorithms.
  • Keywords
    feature extraction; genetic algorithms; geometry; text analysis; cosine measure; genetic algorithm; relevance score; sentence extraction; text document; text summarization; Data mining; Functional analysis; Genetic algorithms; Graph theory; HTML; Information retrieval; Information technology; Internet; Web pages; World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence, 2005. Proceedings. The 2005 IEEE/WIC/ACM International Conference on
  • Print_ISBN
    0-7695-2415-X
  • Type

    conf

  • DOI
    10.1109/WI.2005.57
  • Filename
    1517852