DocumentCode
2638299
Title
Research of massive heterogeneous data integration based on Lucene and XQuery
Author
Tianyuan, Liu ; Meina, Song ; Xiaoqi, Zhang
Author_Institution
Sch. of Comput., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear
2010
fDate
16-17 Aug. 2010
Firstpage
648
Lastpage
652
Abstract
This paper proposes a model of massive heterogeneous data integration system based on Lucene and XQuery. This model shields distribution and heterogeneity of resources and achieves transparent access using materialized view of database. The query efficiency is increased due to the highly effective categorization algorithm to segment data as an index with open source tool Lucene. Further, the model makes full use of the advantage of XQuery, which can process not only structured data but also non-structured data so as to solve the significant difference among various data sources as well as the efficiency of massive data access.
Keywords
data handling; information retrieval; public domain software; query languages; search engines; XQuery; categorization algorithm; massive data access; massive heterogeneous data integration system; open source tool Lucene; Algorithm design and analysis; Distributed databases; Indexes; Libraries; Query processing; XML; Heterogeneous Data; Lucene; Massive; XQuery;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Society (SWS), 2010 IEEE 2nd Symposium on
Conference_Location
Beijing
Print_ISBN
978-1-4244-6356-5
Type
conf
DOI
10.1109/SWS.2010.5607370
Filename
5607370
Link To Document