• DocumentCode
    3342965
  • Title

    Hybrid Parallelism for XML SAX Parsing

  • Author

    Pan, Yinfei ; Zhang, Ying ; Chiu, Kenneth

  • Author_Institution
    Dept. of Comput. Sci., State Univ. of New York, Binghamton, NY
  • fYear
    2008
  • fDate
    23-26 Sept. 2008
  • Firstpage
    505
  • Lastpage
    512
  • Abstract
    XML has been widely adopted across a wide spectrum of applications. Its parsing efficiency, however, remains a concern, and can be a bottleneck. At the same time, with the trend towards multicore CPUs, parallelization to improve performance has become increasingly relevant. In previous work, we have investigated parallelizing DOM-style parsing and gained significant speedup. For streaming XML applications, however, SAX-style parsing is often required. In this paper, we present a technique and implementation of a parallel XML SAX parser. To handle inherent data dependencies in XML while still allowing reasonable scalability, we use a 4-stage software pipeline with a combination of strictly sequential stages and stages that can be further data-parallelized within the stage. We thus utilize a hybrid between pipelined parallelism and data parallelism. To demonstrate effectiveness, we test this approach on a Linux machine with two Intel Xeon L5320 CPUs for a total of 8 physical cores, and obtain good speedup up to about 8 CPUs.
  • Keywords
    XML; parallelising compilers; pipeline processing; 4-stage software pipeline; Linux machine; XML SAX parsing; hybrid parallelism; Application software; Computer science; Concurrent computing; Hardware; Multicore processing; Parallel processing; Scalability; Web services; XML; Yarn; XML; hybrid parallelism; parallel SAX-style parsing; parallel stream processing; parallelism;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Services, 2008. ICWS '08. IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-0-7695-3310-0
  • Electronic_ISBN
    978-0-7695-3310-0
  • Type

    conf

  • DOI
    10.1109/ICWS.2008.107
  • Filename
    4670214