• DocumentCode
    1279149
  • Title

    A metapattern-based automated discovery loop for integrated data mining-unsupervised learning of relational patterns

  • Author

    Shen, Wei-Min ; Leng, Bing

  • Author_Institution
    Inf. Sci. Inst., Univ. of Southern California, Marina del Rey, CA, USA
  • Volume
    8
  • Issue
    6
  • fYear
    1996
  • fDate
    12/1/1996 12:00:00 AM
  • Firstpage
    898
  • Lastpage
    910
  • Abstract
    A metapattern (also known as a metaquery) is a new approach for integrated data mining systems. As opposed to a typical “toolbox”-like integration, where components must be picked and chosen by users without much help, metapatterns provide a common representation for inter-component communication as well as a human interface for hypothesis development and search control. One weakness of this approach, however, is that the task of generating fruitful metapatterns is still a heavy burden for human users. In this paper, we describe a metapattern generator and an integrated discovery loop that can automatically generate metapatterns. Experiments in both artificial and real-world databases have shown that this new system goes beyond the existing machine learning technologies, and can discover relational patterns without requiring humans to pre-label the data as positive or negative examples for some given target concepts. With this technology, future data mining systems could discover high-quality, human-comprehensible knowledge in a much more efficient and focused manner, and data mining could be managed easily by both expert and less-expert users
  • Keywords
    deductive databases; knowledge acquisition; query processing; relational databases; unsupervised learning; user interfaces; automated discovery loop; databases; deduction; high-quality human-comprehensible knowledge discovery; human interaction; human interface; hypothesis development; induction; integrated data mining systems; inter-component communication; metapattern generator; metaquery; relational patterns; search control; target concepts; unsupervised learning; Automatic control; Communication system control; Data analysis; Data mining; Humans; Knowledge management; Machine learning; Machine learning algorithms; Relational databases; Unsupervised learning;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.553157
  • Filename
    553157