Title :
Using tries to eliminate pattern collisions in perfect hashing
Author :
Brain, Marshall D. ; Tharp, Alan L.
Author_Institution :
Dept. of Comput. Sci., North Carolina State Univ., Raleigh, NC, USA
fDate :
4/1/1994 12:00:00 AM
Abstract :
Many current perfect hashing algorithms suffer from the problem of pattern collisions. In this paper, a perfect hashing technique that uses array-based tries and a simple sparse matrix packing algorithm is introduced. This technique eliminates all pattern collisions, and, because of this, it can be used to form ordered minimal perfect hashing functions on extremely large word lists. This algorithm is superior to other known perfect hashing functions for large word lists in terms of function building efficiency, pattern collision avoidance, and retrieval function complexity. It has been successfully used to form an ordered minimal perfect hashing function for the entire 24481 element Unix word list without resorting to segmentation. The item lists addressed by the perfect hashing function formed can be ordered in any manner, including alphabetically, to easily allow other forms of access to the same list
Keywords :
Unix; computational complexity; file organisation; list processing; Unix word list; array-based tries; function building efficiency; item lists; large word lists; ordered minimal perfect hashing function; ordering; pattern collision avoidance; perfect hashing algorithms; retrieval function complexity; sparse array packing; sparse matrix packing algorithm; CD-ROMs; Catalogs; Collision avoidance; Computer science; Data structures; Databases; Dictionaries; Immune system; Information retrieval; Sparse matrices;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on