• DocumentCode
    2889171
  • Title

    Database Encoding and An Anti-Apriori Algorithm for Association Rules Mining

  • Author

    Wang, Tong ; He, Pi-Lian

  • Author_Institution
    Vocational Technique Instruction Coll., Tianjin Univ.
  • fYear
    2006
  • fDate
    13-16 Aug. 2006
  • Firstpage
    1195
  • Lastpage
    1198
  • Abstract
    A method for encoding database is put forward in this paper. By this way, a record is denoted by only one binary number and so the size of the database is reduced sharply. If the database-encoding algorithm is used into some known modified algorithms, the efficiency will be improved remarkably. At the meantime, a new algorithm, anti-Apriori, which based on the proposed encoding method is introduced either. By using some properties of numbers, the itemsets of the database can be transformed into numerical fields. Different from the Apriori algorithm, the new one discovers the association rules from the largest frequent itemset at first, and then all sub itemset, which are also frequent, will be gained without any farther calculation, and all the other small frequent itemset that must be generated in the Apriori be omitted, and the times of the database scan is also reduced. Test results show the new algorithm based on the encoding database has a lower complexity of time and space
  • Keywords
    computational complexity; data mining; encoding; very large databases; antiApriori algorithm; association rules mining; database scanning; database-encoding algorithm; frequent itemset; Association rules; Computer science; Cybernetics; Data mining; Databases; Educational institutions; Encoding; Helium; Itemsets; Machine learning; Machine learning algorithms; Marketing and sales; Transaction databases; Anti-Apriori algorithm; association rules; data mining; database encoding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2006 International Conference on
  • Conference_Location
    Dalian, China
  • Print_ISBN
    1-4244-0061-9
  • Type

    conf

  • DOI
    10.1109/ICMLC.2006.258604
  • Filename
    4028245