DocumentCode :
1980109
Title :
On the construction of reduced double array structures by dividing tries
Author :
Oono, Masao Fuketa Masaki ; Morita, Kazuhiro ; Aoe, Jun-Ichi
Author_Institution :
Dept. of Inf. Sci. & Intelligent Syst., Tokushima Univ., Japan
Volume :
1
fYear :
2001
fDate :
2001
Firstpage :
500
Abstract :
Speed and storage capacity are big issues in information retrieval system. For natural language analysis, a double array is one of a data structure of trie, which is a well-known approach to retrieve strings in a dictionary, and helps very fast access in a matrix table with compactness of a list form. In order to realize quite compact structure, this paper presents a compression method by dividing the trie constructed into several pieces (blocks). It enables us to reduce the number of bits representing entries of the double arrays. The trie obtained must trace to the blocks and this causes that retrieval time might be slow because of a state connection. To solve this problem, we propose a new trie construction method to compress and minimize the number of state connections. After applying to a large set of keys, experimental result shows that the storage capacity has been reduced to 50% even though the approach we present has the same retrieval speed
Keywords :
data compression; information retrieval systems; natural languages; string matching; tree data structures; IRS; compact structure; data structure; dictionary; information retrieval system speed; matrix table; reduced double array structure construction; state connection compression; state connection minimization; storage capacity; string retrieval; trie division; Computer networks; Data structures; Dictionaries; Hardware; IP networks; Information analysis; Information retrieval; Information science; Natural languages; Tree data structures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics, 2001 IEEE International Conference on
Conference_Location :
Tucson, AZ
ISSN :
1062-922X
Print_ISBN :
0-7803-7087-2
Type :
conf
DOI :
10.1109/ICSMC.2001.969863
Filename :
969863
Link To Document :
بازگشت