DocumentCode
3230789
Title
ParaParse: A parallel method for XML parsing
Author
Chen, Rongxin ; Liao, Husheng
Author_Institution
Coll. of Comput. Sci., Beijing Univ. of Technol., Beijing, China
fYear
2011
fDate
27-29 May 2011
Firstpage
81
Lastpage
85
Abstract
Full manipulation of XML data has to rely on parsing process before hand. XML parsing is CPU intensive and tends to greatly affect the performance of XML application in a general way. Moreover, parallel computing is widely introduced to solve practical problems due to the popularization of multi-core computers. It is a natural and promising way to parallelize parsing process. Existing parallel parsing methods need pre-parsing stage to get proper data partitions. Unfortunately pre-parsing is often time consuming and difficult to be optimized. ParaParse presented in this paper is a novel parallel method for XML parsing. It has a rather light weighted data partition way and supports parsing arbitrarily partitioned XML segments in parallel. After that subtree merging is carried out to generate global XML tree. The parsing result can be further wrapped for sophisticated XML query. Experiment results show that ParaParse is suited to multi-core environment to realize parallel XML parsing.
Keywords
XML; multiprocessing systems; parallel processing; program compilers; CPU intensive; ParaParse; XML data; XML parsing; XML segments; data partitions; multicore computers; parallel computing; parallel method; parallelize parsing process; Prefetching; Redundancy; XML; XML parsing; multi-core; parallel computing;
fLanguage
English
Publisher
ieee
Conference_Titel
Communication Software and Networks (ICCSN), 2011 IEEE 3rd International Conference on
Conference_Location
Xi´an
Print_ISBN
978-1-61284-485-5
Type
conf
DOI
10.1109/ICCSN.2011.6014223
Filename
6014223
Link To Document