DocumentCode
1125804
Title
A High Speed String Correction Method Using a Hierarchical File
Author
Tanaka, Eiichi ; Kojima, Yurie
Author_Institution
Department of Information Science, Faculty of Engineering, Utsunomiya University, Utsunomiay 321, Japan.
Issue
6
fYear
1987
Firstpage
806
Lastpage
815
Abstract
This paper describes a high speed string correction method using a hierarchical file. After reviewing a string correction method based on the Levenshtein distance, a hierarchical file construction method is introduced. A multistage string correction method using this file is proposed. The lower bound of computational complexity is estimated, and it is shown that a multistage method using a special type of a hierarchical file can reduce computational labor greatly. The larger the number of strings considered is, the more efficient the method becomes. The results of computer simulations on 5374 phoneme sequences using two and three stage correction methods are stated. The condition for a multistage string correction method to obtain higher correction rates than an ordinary dictionary method is included.
Keywords
Biology; Computational complexity; Computer simulation; Dictionaries; Error correction; Sequences; Speech processing; Speech recognition; Statistical analysis; Vocabulary; Deletion; Levenshtein; hierarchical file; insertion; speech recognition; string comparison; string edit; substitution;
fLanguage
English
Journal_Title
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/TPAMI.1987.4767987
Filename
4767987
Link To Document