DocumentCode :
2012109
Title :
Similarity Evaluation and Shape Feature Extraction for Character Pattern Retrieval to Support Reading Historical Documents
Author :
Kitadai, Akihito ; Nakagawa, Masaki ; Baba, Hajime ; Watanabe, Akihiro
Author_Institution :
J.F. Oberlin Univ., Tokyo, Japan
fYear :
2012
fDate :
27-29 March 2012
Firstpage :
359
Lastpage :
363
Abstract :
We have many historical documents written in over 1,000 years ago. Shape features of character patterns on the documents are unstable or missing because most of the documents have been stained and degraded deeply. Digital archives of the documents with accurate character pattern retrieval methods are helpful for archaeologists and historians. In this paper, we propose a similarity evaluation method for character patterns with missing shape parts. It collaboratively works with non-linear normalization for such patterns, and modifies the templates for each trial of the retrieval efficiently. In the experiences using 4,911 Kanji (Chinese origin) character patterns from the Japanese historical documents called mokkans, the method shows improvements of the retrieval accuracy. Also, we present a simple implementation of gradient feature extraction to compare the chain code feature with the gradient feature in the retrieval. As the result, the gradient feature works better than the chain code feature.
Keywords :
document image processing; feature extraction; gradient methods; history; image retrieval; records management; Japanese historical documents; chain code feature; character pattern retrieval; digital archives; gradient feature; mokkans; nonlinear normalization; shape feature extraction; support reading historical documents; Accuracy; Electronic mail; Feature extraction; Histograms; Humans; Image reconstruction; Shape; character pattern retrieval; gradient feature; historical documents; mokkan;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
Conference_Location :
Gold Cost, QLD
Print_ISBN :
978-1-4673-0868-7
Type :
conf
DOI :
10.1109/DAS.2012.80
Filename :
6195394
Link To Document :
بازگشت