DocumentCode
2959921
Title
Constructing Corpus for Query-Oriented XML Text Summarization
Author
Wu, Shihan ; Liu, Dexi ; Jiao, Xianpei
Author_Institution
Jiangxi Key Lab. of Data & Knowledge Eng., Jiangxi Univ. of Finance &Econ., Nanchang, China
fYear
2010
fDate
23-24 Oct. 2010
Firstpage
45
Lastpage
49
Abstract
XML Retrieval is becoming the focus study of the field of Information Retrieval and Database. Summarization of the results which come from the XML search engines will alleviate the read burden of user´s. However, as the basis of this study, the construction of the query-oriented XML text summarization corpus has not yet received enough attention. In this paper, we introduce our works on constructing this kind of corpus, including the selection of topics and XML elements/documents, construction process and the feature of the constructed corpus. Up to now, the corpus has 25 English query topics, including 422 elements for summarization, and 32 Chinese topics which including 402 elements. For each topic, 4 pieces of extracted summaries and 4 pieces of generated summaries are made manually by 4 experts.
Keywords
XML; query processing; search engines; text analysis; Chinese topics; English query topics; XML documents; XML elements; XML retrieval; XML search engine; corpus construction; information retrieval; query-oriented XML text summarization; result summarization; Databases; Education; Feature extraction; Machine learning; Pragmatics; Security; XML; Automatic Summarization; Corpus; Query-oriented; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Management of e-Commerce and e-Government (ICMeCG), 2010 Fourth International Conference on
Conference_Location
Chengdu
Print_ISBN
978-1-4244-8507-9
Type
conf
DOI
10.1109/ICMeCG.2010.18
Filename
5628629
Link To Document