Title :
Computation of normalized edit distance and applications
Author :
Marzal, Andrés ; Vidal, Enrique
Author_Institution :
Dept. de Sistemas Inf. y Computacion, Valencia Univ., Spain
fDate :
9/1/1993 12:00:00 AM
Abstract :
Given two strings X and Y over a finite alphabet, the normalized edit distance between X and Y, d(X,Y) is defined as the minimum of W(P)/L(P), where P is an editing path between X and Y, W(P) is the sum of the weights of the elementary edit operations of P, and L(P) is the number of these operations (length of P). It is shown that in general, d(X ,Y) cannot be computed by first obtaining the conventional (unnormalized) edit distance between X and Y and then normalizing this value by the length of the corresponding editing path. In order to compute normalized edit distances, an algorithm that can be implemented to work in O(m×n2) time and O( n2) memory space is proposed, where m and n are the lengths of the strings under consideration, and m ⩾n. Experiments in hand-written digit recognition are presented, revealing that the normalized edit distance consistently provides better results than both unnormalized or post-normalized classical edit distances
Keywords :
computational complexity; pattern recognition; character strings; finite alphabet; hand-written digit recognition; normalized edit distance; words; Character recognition; Error correction; Optical character recognition software; Pattern recognition; Speech recognition;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on