DocumentCode
2183807
Title
Effective summarization method of text documents
Author
Alguliev, Rasim M. ; Aliguliyev, Ramiz M.
Author_Institution
Inst. of Inf. Technol., Azerbaijan Nat. Acad. of Sci., Baku, Azerbaijan
fYear
2005
fDate
19-22 Sept. 2005
Firstpage
264
Lastpage
271
Abstract
In this paper, we propose text summarization method that creates text summary by definition of the relevance score of each sentence and extracting sentences from the original documents. This summarization method takes into account the weight of each sentence in the document. The essence of the method suggested is in preliminary identification of every sentence in the document with characteristic vector of words, which appear in the document, and calculation of relevance score for each sentence. The relevance score of sentence is determined through its comparison with all the other sentences in the document and with the document title by cosine measure. Prior to application of this method, the scope of features is defined and then the weight of each word in the sentence is calculated with account of those features. The weights of features, influencing relevance of words, are determined using genetic algorithms.
Keywords
feature extraction; genetic algorithms; geometry; text analysis; cosine measure; genetic algorithm; relevance score; sentence extraction; text document; text summarization; Data mining; Functional analysis; Genetic algorithms; Graph theory; HTML; Information retrieval; Information technology; Internet; Web pages; World Wide Web;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence, 2005. Proceedings. The 2005 IEEE/WIC/ACM International Conference on
Print_ISBN
0-7695-2415-X
Type
conf
DOI
10.1109/WI.2005.57
Filename
1517852
Link To Document