Title :
A Fuzzy-Rough Hybrid Approach to Multi-document Extractive Summarization
Author :
Huang, Hsun-Hui ; Yang, Horng-Chang ; Kuo, Yau-Hwang
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ, Tainan, Taiwan
Abstract :
To generate a multi-document extractive summary,the measurement of sentence relevance is of vital importance.Earlier work, exploring statistics of textual terms at the word(surface) level, faces the problem that the textual terms may be synonymous or ploysemous. This may lead to misrank sentence relevance and may cause redundant information presented in the generated summary. Furthermore, the relationships between concepts expressed by natural languages are inherently fuzzy,which invites the use of fuzzy set and rough set theory. In this paper, we investigate some sentence features from a concept level space and apply a fuzzy-rough hybrid scheme to define a sentence relevance measure. Our approach is applied to the DUC2006 multi-document summarization tasks. The experimental results show our approach is promising and demonstrate the effectiveness of fuzzy set and rough set theory in the application of text summarization.
Keywords :
fuzzy set theory; natural languages; rough set theory; text analysis; fuzzy-rough hybrid approach; multidocument extractive summarization; multidocument extractive summary; natural language; textual term; Computer science; Data mining; Explosions; Extraterrestrial measurements; Fuzzy set theory; Hybrid intelligent systems; Management information systems; Natural languages; Set theory; Statistics; fuzzy-rough hybrid; sense-based fuzzy set; sentence feature vector;
Conference_Titel :
Hybrid Intelligent Systems, 2009. HIS '09. Ninth International Conference on
Conference_Location :
Shenyang
Print_ISBN :
978-0-7695-3745-0
DOI :
10.1109/HIS.2009.41