DocumentCode :
2022453
Title :
Redundant Bit Vectors for Robust Indexing and Retrieval of Electronic Ink
Author :
Chellapilla, Kumar ; Platt, John
Author_Institution :
Microsoft Res., Redmond
Volume :
1
fYear :
2007
fDate :
23-26 Sept. 2007
Firstpage :
387
Lastpage :
391
Abstract :
This paper presents a redundant bit vector approach for indexing and retrieval of handwritten words captured using an electronic pen or tablet. Handwritten words (cursive or print) are first segmented into strokes and each stroke is featurized using a neural network. Oriented principal component analysis (OPCA) is used for dimensionality reduction while ensuring robustness to handwriting variation (noise). Redundant bit vectors are used to index the resulting low dimensional representations for efficient storage and retrieval. Experimental results on large datasets with 898,652 handwritten words show good retrieval performance that is robust to handwriting variations and generalizes well over different writers and writing styles.
Keywords :
indexing; information retrieval; neural nets; principal component analysis; electronic ink retrieval; electronic pen; handwritten words indexing; handwritten words retrieval; neural network; oriented principal component analysis; redundant bit vectors; robust indexing; writing styles; Chebyshev approximation; Handwriting recognition; Indexing; Information retrieval; Ink; Neural networks; Noise robustness; Personal digital assistants; Principal component analysis; Shape;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
ISSN :
1520-5363
Print_ISBN :
978-0-7695-2822-9
Type :
conf
DOI :
10.1109/ICDAR.2007.4378737
Filename :
4378737
Link To Document :
بازگشت