• DocumentCode
    176958
  • Title

    Switch-SSD cache based XML query processing in Hadoop

  • Author

    Changlong Zhou ; Minghua Jiang ; Yu Feng

  • Author_Institution
    Sch. of Math. & Comput., Wuhan Textile Univ., Wuhan, China
  • fYear
    2014
  • fDate
    29-30 Sept. 2014
  • Firstpage
    1131
  • Lastpage
    1134
  • Abstract
    Hadoop as open source software that implements the MapReduce framework is an ideal solution to speed up a XML parallel query processing. We proposed a distributed caching architecture in Hadoop cluster, called switch-SSD which cache XML query results en-route in the network switching nodes. Switch-SSD extends extend OpenFlow switches limited memory space with SSD for caching XML query results in the switch. We design an OpenFlow controller as a cache Manager conducting the switch-SSDs. At the help of the controller, the switch-SSD intercepts the query request and proactively sends the caching results to the client rather than a client conducts cache read operation. By caching the results, switch-SSD reduces calculation of query and lowers the job execution times in Hadoop cluster. Experimental results show that switch-SSD can improve the efficiency of most existing XML parallel query processing in Hadoop cluster.
  • Keywords
    XML; cache storage; parallel processing; public domain software; query processing; Hadoop cluster; MapReduce framework; OpenFlow controller; XML parallel query processing; distributed caching architecture; open source software; switch-SSD cache; Computer architecture; Conferences; Distributed databases; Query processing; Switches; XML; Cache; Hadoop; Switch-SSD; XML parallel query;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Research and Technology in Industry Applications (WARTIA), 2014 IEEE Workshop on
  • Conference_Location
    Ottawa, ON
  • Type

    conf

  • DOI
    10.1109/WARTIA.2014.6976477
  • Filename
    6976477