Title :
Multi-Document Summarization as Applied in Information Retrieval
Author :
Zhou, Dan ; Li, Lei
Author_Institution :
Center for Intelligence Sci. & Technol. Res., Beijing Univ. of Posts & Telecommun., Beijing
fDate :
Aug. 30 2007-Sept. 1 2007
Abstract :
In this paper we presented the use of multi-document summarization as postprocessing step in information retrieval (IR). We examined the differences between requirements for general multi-document summarization and requirements when it is applied for IR, and highlighted the requirements for clustering and context information extraction, which is much helpful to the users for browsing and searching relative results. To generate this type of summary, we first cluster the retrieved documents by their topics using a repeated bisection algorithm, and extract the centroid words for each cluster. The final summary is generated on the base of the query words and the cluster centroids, containing query-centered information as well as context information.
Keywords :
abstracting; document handling; pattern clustering; query processing; centroid words; clustering requirements; context information extraction; information retrieval; multidocument summarization; query words; repeated bisection algorithm; Clustering algorithms; Data mining; Explosions; Guidelines; Information filtering; Information filters; Information retrieval;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2007. NLP-KE 2007. International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-1610-3
Electronic_ISBN :
978-1-4244-1611-0
DOI :
10.1109/NLPKE.2007.4368034