• DocumentCode
    3263858
  • Title

    Efficient integration of data mining techniques in database management systems

  • Author

    Bentayeb, Fadila ; Darmont, Jerome ; Udréa, Cédric

  • Author_Institution
    ERIC, Lyon Univ. 2, Bron, France
  • fYear
    2004
  • fDate
    7-9 July 2004
  • Firstpage
    59
  • Lastpage
    67
  • Abstract
    We propose a new approach for applying data mining techniques, and more particularly supervised machine learning algorithms, to large databases, in acceptable response times. This goal is achieved by integrating these algorithms within a database management system. We are thus only limited by disk capacity, and not by available main memory. However, the disk accesses that are necessary to scan the database induce long response times. Hence, we propose an original method to reduce the size of the learning set by building its contingency table. The machine learning algorithms are then adapted to operate on this contingency table. In order to validate our approach, we implemented the IDS decision tree construction method and showed that using the contingency table helped us obtaining response times equivalent to those of classical, in-memory software.
  • Keywords
    data mining; database management systems; decision trees; learning (artificial intelligence); IDS decision tree construction method; data mining; database management systems; disk capacity; supervised machine learning; Buildings; Data analysis; Data mining; Database systems; Decision trees; Delay; Machine learning algorithms; Memory management; Relational databases; Spatial databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database Engineering and Applications Symposium, 2004. IDEAS '04. Proceedings. International
  • ISSN
    1098-8068
  • Print_ISBN
    0-7695-2168-1
  • Type

    conf

  • DOI
    10.1109/IDEAS.2004.1319778
  • Filename
    1319778