• DocumentCode
    2192342
  • Title

    Efficient Rule Generation for Dominant Class Problems on LARM

  • Author

    Fu, JuiHsi ; Lee, SingLing

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Chung Cheng Univ., Chiayi, Taiwan
  • fYear
    2010
  • fDate
    13-13 Dec. 2010
  • Firstpage
    570
  • Lastpage
    576
  • Abstract
    Lazy Associative Rule Mining (LARM) integrates lazy learning and Associative Rule Mining (ARM) to tailor label prediction results by generating related class associative rules (CARs) only when an unlabeled document comes. However, two main problems should be carefully concerned in LARM classification: (1) computing efficiency and (2) dominant class bias prediction. The main idea of the proposed method, LARM-DC, is to skip rule-inducing process in LARM so that the execution time could be greatly saved. Additionally, the confidences of LCARs are weighted to enhance rule importance on label prediction in order to correct the bias results. With regard to prediction accuracy, our experiments show that LARM-DC performs as well as LARM on balanced datasets, and gains significant improvement on imbalanced datasets. Moreover, classification efficiency is also greatly improved comparing with LARM that generally requires lots of CARs to be induced.
  • Keywords
    data mining; document handling; learning (artificial intelligence); pattern classification; CAR; LARM classification; class associative rule; dominant class bias prediction; dominant class problem; label prediction; lazy associative rule mining; lazy learning; rule generation; unlabeled document; Bias Prediction; Document Classification; Dominant Class Problems; Lazy Associative Rule Mining (LARM); Lazy Learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2010 IEEE International Conference on
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    978-1-4244-9244-2
  • Electronic_ISBN
    978-0-7695-4257-7
  • Type

    conf

  • DOI
    10.1109/ICDMW.2010.25
  • Filename
    5693348