Title of article
Generalized Hamming Distance
Author/Authors
Bookstein، Abraham نويسنده , , Kulyukin، Vladimir A. نويسنده , , Raita، Timo نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2002
Pages
-352
From page
353
To page
0
Abstract
Many problems in information retrieval and related fields depend on a reliable measure of the distance or similarity between objects that, most frequently, are represented as vectors. This paper considers vectors of bits. Such data structures implement entities as diverse as bitmaps that indicate the occurrences of terms and bitstrings indicating the presence of edges in images. For such applications, a popular distance measure is the Hamming distance. The value of the Hamming distance for information retrieval applications is limited by the fact that it counts only exact matches, whereas in information retrieval, corresponding bits that are close by can still be considered to be almost identical. We define a “Generalized Hamming distance” that extends the Hamming concept to give partial credit for near misses, and suggest a dynamic programming algorithm that permits it to be computed efficiently. We envision many uses for such a measure. In this paper we define and prove some basic properties of the “Generalized Hamming distance”, and illustrate its use in the area of object recognition. We evaluate our implementation in a series of experiments, using autonomous robots to test the measureʹs effectiveness in relating similar bitstrings.
Keywords
robot vision , information retrieval , hamming distance , computer vision , Metrics , Object recognition , Image retrieval
Journal title
INFORMATION RETRIEVAL
Serial Year
2002
Journal title
INFORMATION RETRIEVAL
Record number
89804
Link To Document