DocumentCode
3488336
Title
Mixed Thai-English Character Classification Based on Histogram of Oriented Gradient Feature
Author
Siriteerakul, Teera
Author_Institution
Fac. of Sci., King Mongkut´s Inst. of Technol. Ladkrabang, Bangkok, Thailand
fYear
2013
fDate
25-28 Aug. 2013
Firstpage
847
Lastpage
851
Abstract
The task of classifying mixed Thai-English characters carries considerable challenges due to the number and complexity of the characters. This paper proposes and empirically investigates the performance of a classification system that uses Histogram of Oriented Gradient as an image feature with Support Vector Machine as a classification tool. The experiments were done on the datasets provided by NECTEC which consists of over 600,000 printed images of individual characters from 142 distinct classes. With this proposed method, an accuracy of 97% can be achieved without a look up dictionary or any post-processing system.
Keywords
character recognition; image classification; natural language processing; support vector machines; NECTEC; classification tool; histogram of oriented gradient feature; mixed Thai-English character classification; support vector machine; Accuracy; Character recognition; Feature extraction; Histograms; Support vector machines; Training; Vectors; Character classification; Histogram of Oriented Gradient; Thai OCR;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location
Washington, DC
ISSN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2013.173
Filename
6628738
Link To Document