• DocumentCode
    2638299
  • Title

    Research of massive heterogeneous data integration based on Lucene and XQuery

  • Author

    Tianyuan, Liu ; Meina, Song ; Xiaoqi, Zhang

  • Author_Institution
    Sch. of Comput., Beijing Univ. of Posts & Telecommun., Beijing, China
  • fYear
    2010
  • fDate
    16-17 Aug. 2010
  • Firstpage
    648
  • Lastpage
    652
  • Abstract
    This paper proposes a model of massive heterogeneous data integration system based on Lucene and XQuery. This model shields distribution and heterogeneity of resources and achieves transparent access using materialized view of database. The query efficiency is increased due to the highly effective categorization algorithm to segment data as an index with open source tool Lucene. Further, the model makes full use of the advantage of XQuery, which can process not only structured data but also non-structured data so as to solve the significant difference among various data sources as well as the efficiency of massive data access.
  • Keywords
    data handling; information retrieval; public domain software; query languages; search engines; XQuery; categorization algorithm; massive data access; massive heterogeneous data integration system; open source tool Lucene; Algorithm design and analysis; Distributed databases; Indexes; Libraries; Query processing; XML; Heterogeneous Data; Lucene; Massive; XQuery;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Society (SWS), 2010 IEEE 2nd Symposium on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-6356-5
  • Type

    conf

  • DOI
    10.1109/SWS.2010.5607370
  • Filename
    5607370