DocumentCode :
2020827
Title :
Learning to Group Text Lines and Regions in Freeform Handwritten Notes
Author :
Ye, Ming ; Viola, Paul ; Raghupathy, Sashi ; Sutanto, Herry ; Li, Chengyang
Author_Institution :
Microsoft Corp., Redmond
Volume :
1
fYear :
2007
fDate :
23-26 Sept. 2007
Firstpage :
28
Lastpage :
32
Abstract :
This paper proposes a machine learning approach to grouping problems in ink parsing. Starting from an initial segmentation, hypotheses are generated by perturbing local configurations and processed in a high-confidence-first fashion, where the confidence of each hypothesis is produced by a data-driven AdaBoost decision-tree classifier with a set of intuitive features. This framework has successfully applied to grouping text lines and regions in complex freeform digital ink notes from real TabletPC users. It holds great potential in solving many other grouping problems in the ink parsing and document image analysis domains.
Keywords :
decision trees; document image processing; group theory; handwritten character recognition; image classification; image segmentation; ink; learning (artificial intelligence); program compilers; text analysis; AdaBoost decision-tree classifier; TabletPC; document image analysis; freeform digital ink notes; freeform handwritten notes; group text lines; group text regions; ink parsing; machine learning; text segmentation; Image analysis; Image converters; Image segmentation; Ink; Iterative algorithms; Machine learning; Merging; Paper technology; Text analysis; Writing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
ISSN :
1520-5363
Print_ISBN :
978-0-7695-2822-9
Type :
conf
DOI :
10.1109/ICDAR.2007.4378670
Filename :
4378670
Link To Document :
بازگشت