• DocumentCode
    1697908
  • Title

    Image data mining from financial documents based on wavelet features

  • Author

    El Badawy, Ossama ; El-Sakka, Mahmoud R. ; Hassanein, Khaled ; Kamel, Mohamed S.

  • Author_Institution
    Dept. of Syst. Design Eng., Waterloo Univ., Ont., Canada
  • Volume
    1
  • fYear
    2001
  • fDate
    6/23/1905 12:00:00 AM
  • Firstpage
    1078
  • Abstract
    We present a framework for clustering and classifying cheque images according to their payee-line content. The features used in the clustering and classification processes are extracted from the wavelet domain by means of thresholding and counting of wavelet coefficients. The feasibility of this framework is tested on a database of 2620 cheque images. This database consists of cheques from 10 different accounts. Each account is written by a different person. Clustering and classification are performed separately on each account using distance-based techniques. We achieved correct-classification rates of 86% and 81% for the supervised and unsupervised learning cases, respectively. These rates are the average of correct-classification rates obtained from the 10 different accounts
  • Keywords
    banking; cheque processing; data mining; document image processing; feature extraction; handwritten character recognition; image classification; learning (artificial intelligence); pattern clustering; unsupervised learning; wavelet transforms; cheque image classifying; cheque image clustering; distance-based techniques; feature extraction; financial documents; handwritten text; image data mining; payee-line content; supervised learning; thresholding; unsupervised learning; wavelet features; Data mining; Design engineering; Discrete wavelet transforms; Feature extraction; Image databases; Pattern recognition; Spatial databases; Systems engineering and theory; Unsupervised learning; Wavelet domain;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 2001. Proceedings. 2001 International Conference on
  • Conference_Location
    Thessaloniki
  • Print_ISBN
    0-7803-6725-1
  • Type

    conf

  • DOI
    10.1109/ICIP.2001.959236
  • Filename
    959236