DocumentCode :
735861
Title :
An optimal clustering for fuzzy categorization of cursive handwritten text with weight learning in textual attributes
Author :
Sarker, Goutam ; Dhua, Silpi ; Besra, Monica
Author_Institution :
Dept. of CSE, NIT, Durgapur, India
fYear :
2015
fDate :
9-11 July 2015
Firstpage :
6
Lastpage :
11
Abstract :
A new method for fuzzy categorization of cursive handwritten text has been addressed in the present work. This is based on input text clustering and subsequent learning of weighted attributes in each subject cluster. The system first employs a new algorithm to detect the letter boundary in each cursive word in the textual sentences. A Modified Optimal Clustering Algorithm (MOCA) and Back Propagation (BP) Network combination converts the handwritten texts into printed ones. Subject wise grouping of printed texts are then made with Optimal Clustering Algorithm (OCA). The weighted attributes of each subject is thereafter learned to finally find out the fuzzy categorization of each input text. Different performance metrics of the system is computed with a newly introduced concept of Fuzzy Confusion Matrix. The performance evaluation of the fuzzy categorization of text with Holdout Method in terms of accuracy, precision, recall and f-score is appreciably high. Also, the learning and categorization time is quiet affordable.
Keywords :
backpropagation; fuzzy set theory; handwriting recognition; matrix algebra; pattern clustering; performance evaluation; text detection; BP network combination; MOCA; backpropagation network; categorization time; cursive handwritten text; cursive word; fuzzy categorization; fuzzy confusion matrix; holdout method; letter boundary; modified optimal clustering algorithm; performance evaluation; performance metrics; printed text; subsequent learning; text clustering; textual attribute; textual sentence; weight learning; weighted attribute; Accuracy; Clustering algorithms; Feature extraction; Image segmentation; Performance evaluation; Terminology; Training; Accuracy; BP Network; F-Score; Fuzzy Categorization; Fuzzy Clustering; Fuzzy Confusion Matrix; Holdout Method; MOCA; Machine Learning; OCA; Precision; Recall; Subject Glossary; Textual Attributes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Recent Trends in Information Systems (ReTIS), 2015 IEEE 2nd International Conference on
Conference_Location :
Kolkata
Type :
conf
DOI :
10.1109/ReTIS.2015.7232843
Filename :
7232843
Link To Document :
بازگشت