• DocumentCode
    2508577
  • Title

    Optimizations for query index updating in finite automaton based XML stream filtering systems

  • Author

    Qin, Yongrui ; Sun, Weiwei ; Yu, Ping ; Zhang, Zhuoyao

  • Author_Institution
    Dept. of Comput. & Inf. Technol., Fudan Univ., Shanghai
  • fYear
    2008
  • fDate
    8-11 July 2008
  • Firstpage
    509
  • Lastpage
    514
  • Abstract
    XML stream filtering is one of the most popular research topics in XML research area. Many XML stream filtering systems are based on finite automaton (FA). This kind of systems proves to have high performance and scalability in matching the XML-encoded stream to large numbers of queries. The filtering engine is the most important component of a stream filtering system. Single updating and bulk updating approaches for the query index of the filtering engine have been studied in previous works. In this paper, we optimize the previous updating techniques and propose to actually delete useless states immediately after updating the query index to improve the filtering performance. We also design a hybrid query index structure to perform the insertions and deletions of the new arriving queries simultaneously to further reduce the updating cost. Our preliminary experiments show that our approaches provide significantly better scalability and updating performance when compared with existing approaches. Pruning useless states can improve the filtering performance as well.
  • Keywords
    XML; finite automata; information filtering; optimisation; query processing; XML stream filtering systems; XML-encoded stream; filtering engine; finite automaton; query index updating optimizations; Automata; Costs; Doped fiber amplifiers; Information filtering; Information filters; Internet; Matched filters; Scalability; Search engines; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology, 2008. CIT 2008. 8th IEEE International Conference on
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    978-1-4244-2357-6
  • Electronic_ISBN
    978-1-4244-2358-3
  • Type

    conf

  • DOI
    10.1109/CIT.2008.4594727
  • Filename
    4594727