DocumentCode :
1125569
Title :
Recognition of Noisy Subsequences Using Constrained Edit Distances
Author :
Oommen, B. John
Author_Institution :
School of Computer Science, Carleton University, Ottawa, Ont. KIS 5B6, Canada.
Issue :
5
fYear :
1987
Firstpage :
676
Lastpage :
685
Abstract :
Let X* be any unknown word from a finite dictionary H. Let U be any arbitrary subsequence of X*. We consider the problem of estimating X* by processing Y, which is a noisy version of U. We do this by defining the constrained edit distance between XH and Y subject to any arbitrary edit constraint involving the number and type of edit operations to be performed. An algorithm to compute this constrained edit distance has been presented. Although in general the algorithm has a cubic time complexity, within the framework of our solution the algorithm possesses a quadratic time complexity. Recognition using the constrained edit distance as a criterion demonstrates remarkable accuracy. Experimental results which involve strings of lengths between 40 and 80 and which contain an average of 26.547 errors per string demonstrate that the scheme has about 99.5 percent accuracy.
Keywords :
Character recognition; Computer science; Councils; Dictionaries; Constrained editing; Levenshtein metric; String correction; subsequence correction; substring correction;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.1987.4767962
Filename :
4767962
Link To Document :
بازگشت