DocumentCode :
3488336
Title :
Mixed Thai-English Character Classification Based on Histogram of Oriented Gradient Feature
Author :
Siriteerakul, Teera
Author_Institution :
Fac. of Sci., King Mongkut´s Inst. of Technol. Ladkrabang, Bangkok, Thailand
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
847
Lastpage :
851
Abstract :
The task of classifying mixed Thai-English characters carries considerable challenges due to the number and complexity of the characters. This paper proposes and empirically investigates the performance of a classification system that uses Histogram of Oriented Gradient as an image feature with Support Vector Machine as a classification tool. The experiments were done on the datasets provided by NECTEC which consists of over 600,000 printed images of individual characters from 142 distinct classes. With this proposed method, an accuracy of 97% can be achieved without a look up dictionary or any post-processing system.
Keywords :
character recognition; image classification; natural language processing; support vector machines; NECTEC; classification tool; histogram of oriented gradient feature; mixed Thai-English character classification; support vector machine; Accuracy; Character recognition; Feature extraction; Histograms; Support vector machines; Training; Vectors; Character classification; Histogram of Oriented Gradient; Thai OCR;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
ISSN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2013.173
Filename :
6628738
Link To Document :
بازگشت