Title :
A scalable XML indexing method using MapReduce
Author :
Wen-Chiao Hsu ; Hsiao-Chen Shih ; I-En Liao
Author_Institution :
Dept. of Comput. Sci. & Eng., Nat. Chung-Hsing Univ., Taichung, Taiwan
Abstract :
With the advent of the era of big data, cloud computing technology is one of the promising solutions. Many theories and methods, which are originally designed for stand-alone computer, must be re-examined for the applicability in the cloud. For example, most of the XML indexing methods discussed in the literature are suitable for processing small XML files by stand-alone computer. When they deal with a large XML document, memory shortage problem will be encountered. In this paper, we redesign an XML indexing method, called CIS-X (A Compressed Index Scheme for Efficient Query Evaluation of XML Documents) that is developed by our research group, using MapReduce implemented in Hadoop to handle large XML documents through cloud parallel computing. The proposed cloud-based CIS-X can be applied to any XML file without DTD or schema.
Keywords :
Big Data; XML; file organisation; indexing; parallel programming; public domain software; query processing; Big Data; Hadoop; MapReduce; cloud parallel computing; cloud-based CIS-X; compressed index scheme for efficient query evaluation of XML documents; memory shortage problem; scalable XML indexing method; small XML file processing; stand-alone computer; Cloud computing; Encoding; Indexing; Parallel processing; Query processing; XML; CIS-X; Hadoop; MapReduce; XML indexing;
Conference_Titel :
Innovative Computing Technology (INTECH), 2014 Fourth International Conference on
Conference_Location :
Luton
DOI :
10.1109/INTECH.2014.6927757