Title :
A methodology and life cycle model for data mining and knowledge discovery in precision agriculture
Author :
Lee, Seok Won ; Kerschberg, Larry
Author_Institution :
Dept. of Comput. Sci., George Mason Univ., Fairfax, VA, USA
Abstract :
Presents a methodology for data mining and knowledge discovery in large, distributed and heterogeneous databases. In order to obtain potentially interesting patterns, relationships and rules from such large and heterogeneous data collections, it is essential that a methodology be developed to take advantage of the suite of existing methods and tools that are available for data mining and knowledge discovery in databases (KDD). One of the most important methodologies is an integration of diverse learning strategies that cooperatively performs a variety of discovery techniques that achieves high-quality knowledge. The KDLC (knowledge discovery life-cycle) model is an extended study of AqBC, which is a multi-strategy knowledge discovery approach that combines supervised inductive rule learning and unsupervised Bayesian classification via a constructive induction mechanism. A case study dealing with crop yields for a farm in the state of Idaho, USA is presented, and preliminary results are visualized by using the ArcView geographical information system (GIS). The significance of the multi-strategy knowledge discovery and visualization process in analyzing the classifications and learned rules has been empirically verified in KDLC
Keywords :
agriculture; cooperative systems; data mining; data visualisation; distributed databases; learning by example; pattern classification; very large databases; AqBC; ArcView; GIS; Idaho, USA; KDLC; case study; classification analysis; constructive induction mechanism; cooperative discovery techniques; crop yields; data mining; farm; geographical information system; high-quality knowledge; knowledge discovery life-cycle model; large distributed heterogeneous databases; learned rules; learning strategies; multi-strategy knowledge discovery approach; precision agriculture; supervised inductive rule learning; unsupervised Bayesian classification; visualization process; Agriculture; Bayesian methods; Computer science; Data engineering; Data mining; Data visualization; Distributed databases; Knowledge engineering; Laboratories; Visual databases;
Conference_Titel :
Systems, Man, and Cybernetics, 1998. 1998 IEEE International Conference on
Conference_Location :
San Diego, CA
Print_ISBN :
0-7803-4778-1
DOI :
10.1109/ICSMC.1998.725100