• DocumentCode
    2959921
  • Title

    Constructing Corpus for Query-Oriented XML Text Summarization

  • Author

    Wu, Shihan ; Liu, Dexi ; Jiao, Xianpei

  • Author_Institution
    Jiangxi Key Lab. of Data & Knowledge Eng., Jiangxi Univ. of Finance &Econ., Nanchang, China
  • fYear
    2010
  • fDate
    23-24 Oct. 2010
  • Firstpage
    45
  • Lastpage
    49
  • Abstract
    XML Retrieval is becoming the focus study of the field of Information Retrieval and Database. Summarization of the results which come from the XML search engines will alleviate the read burden of user´s. However, as the basis of this study, the construction of the query-oriented XML text summarization corpus has not yet received enough attention. In this paper, we introduce our works on constructing this kind of corpus, including the selection of topics and XML elements/documents, construction process and the feature of the constructed corpus. Up to now, the corpus has 25 English query topics, including 422 elements for summarization, and 32 Chinese topics which including 402 elements. For each topic, 4 pieces of extracted summaries and 4 pieces of generated summaries are made manually by 4 experts.
  • Keywords
    XML; query processing; search engines; text analysis; Chinese topics; English query topics; XML documents; XML elements; XML retrieval; XML search engine; corpus construction; information retrieval; query-oriented XML text summarization; result summarization; Databases; Education; Feature extraction; Machine learning; Pragmatics; Security; XML; Automatic Summarization; Corpus; Query-oriented; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Management of e-Commerce and e-Government (ICMeCG), 2010 Fourth International Conference on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-1-4244-8507-9
  • Type

    conf

  • DOI
    10.1109/ICMeCG.2010.18
  • Filename
    5628629