DocumentCode
124355
Title
A scalable XML indexing method using MapReduce
Author
Wen-Chiao Hsu ; Hsiao-Chen Shih ; I-En Liao
Author_Institution
Dept. of Comput. Sci. & Eng., Nat. Chung-Hsing Univ., Taichung, Taiwan
fYear
2014
fDate
13-15 Aug. 2014
Firstpage
81
Lastpage
86
Abstract
With the advent of the era of big data, cloud computing technology is one of the promising solutions. Many theories and methods, which are originally designed for stand-alone computer, must be re-examined for the applicability in the cloud. For example, most of the XML indexing methods discussed in the literature are suitable for processing small XML files by stand-alone computer. When they deal with a large XML document, memory shortage problem will be encountered. In this paper, we redesign an XML indexing method, called CIS-X (A Compressed Index Scheme for Efficient Query Evaluation of XML Documents) that is developed by our research group, using MapReduce implemented in Hadoop to handle large XML documents through cloud parallel computing. The proposed cloud-based CIS-X can be applied to any XML file without DTD or schema.
Keywords
Big Data; XML; file organisation; indexing; parallel programming; public domain software; query processing; Big Data; Hadoop; MapReduce; cloud parallel computing; cloud-based CIS-X; compressed index scheme for efficient query evaluation of XML documents; memory shortage problem; scalable XML indexing method; small XML file processing; stand-alone computer; Cloud computing; Encoding; Indexing; Parallel processing; Query processing; XML; CIS-X; Hadoop; MapReduce; XML indexing;
fLanguage
English
Publisher
ieee
Conference_Titel
Innovative Computing Technology (INTECH), 2014 Fourth International Conference on
Conference_Location
Luton
Type
conf
DOI
10.1109/INTECH.2014.6927757
Filename
6927757
Link To Document