• DocumentCode
    153400
  • Title

    A New One-Class Classification Method Based on Symbolic Representation: Application to Document Classification

  • Author

    Alaei, Fahimeh ; Girard, N. ; Barrat, Sabine ; Ramel, Jean-Yves

  • Author_Institution
    Lab. d´Inf. (LI EA6300), Univ. Francois-Rabelais de Tours, Tours, France
  • fYear
    2014
  • fDate
    7-10 April 2014
  • Firstpage
    272
  • Lastpage
    276
  • Abstract
    Training a system using a small number of instances to obtain accurate recognition/classification is a crucial need in document classification domain. The one-class classification is chosen since only positive samples are available for the training. In this paper, a new one-class classification method based on symbolic representation method is proposed. Initially a set of features is extracted from the training set. A set of intervals valued symbolic feature vector is then used to represent the class. Each interval value (symbolic data) is computed using mean and standard deviation of the corresponding feature values. To evaluate the proposed one-class classification method a dataset composed of 544 document images was used. Experiment results reveal that the proposed one-class classification method works well even when the number of training samples is small (≤10). Moreover, we noted that the proposed one-class classification method is suitable for document classification and provides better result compared to one-class k-nearest neighbor (k-NN) classifier.
  • Keywords
    document image processing; feature extraction; image classification; document classification; feature extraction; intervals valued symbolic feature vector; k-NN classifier; mean deviation; one-class classification method; one-class k-nearest neighbor classifier; standard deviation; symbolic representation method; Data analysis; Feature extraction; Histograms; Standards; Support vector machines; Testing; Training; Document image classification; One-class classification; Symbolic data representation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on
  • Conference_Location
    Tours
  • Print_ISBN
    978-1-4799-3243-6
  • Type

    conf

  • DOI
    10.1109/DAS.2014.77
  • Filename
    6831012