Title :
On fly search approach for compact XML
Author :
Sathiaseelan, Ruby Carlin Georgewin ; Sitharaman, Sriram ; Subramanian, Raghav Babu ; Senthilkumar, Radha
Author_Institution :
Dept. of Inf. Technol., Anna Univ., Chennai, India
Abstract :
Information Retrieval system produces the result in the order of the most relevant to the least, for given keywords. The user need to know the exact path of the query in the case of retrieval from an XML document or Compact storage structure, this becomes a hurdle for a novice user and it makes the system suitable only for experts. The On Fly Search (OFS) method has been proposed to make the system suitable for all the users and thus helps the users to search the compact storage structures without any knowledge about the content or about the path of the query. It also extends to support the auto complete method for multiple keyword queries. The typographical errors in the query are removed by the usage of fuzzy logic techniques. The effective Indexing Structure in QUICX helps to retrieve the data efficiently from the compact storage structure. The radix trie data structure, ranking function and inverted indexing has been used to have effective on fly search and to retrieve the top k results. The experiments are carried out on standard bench mark datasets like Shakespeare dataset, the results shows that the proposed method helps to retrieve the top-k results for the user query comparatively better than the existing approaches.
Keywords :
XML; data structures; document handling; fuzzy set theory; indexing; information retrieval systems; query processing; OFS method; QUICX indexing structure; Shakespeare dataset; XML document; compact XML; compact storage structure; extensible markup language; fuzzy logic techniques; information retrieval system; inverted indexing; on fly search method; query path; radix trie data structure; ranking function; Data structures; Indexes; Information technology; Keyword search; Market research; XML; XML; compact storage structure; fuzzy search; inverted index; radix trie;
Conference_Titel :
Recent Trends in Information Technology (ICRTIT), 2013 International Conference on
Conference_Location :
Chennai
DOI :
10.1109/ICRTIT.2013.6844228