Abstract :
Increasingly, scientists and business people are mining mountains of stored data for valuable nuggets of insight using computerized data mining (also called knowledge discovery in databases, or KDD) techniques to learn more about customer behavior, discover new quasars, and catch crooks. Data mining recognizes patterns in data and predicts patterns from the data. Today´s data-mining techniques, sometimes called `siftware´, range from OLAP (online application processing) tools that query multidimensional databases, to various statistical techniques, to advanced artificial intelligence techniques like machine learning, neural networks, rule-based systems and genetic algorithms. To efficiently mine gigabytes and terabytes of data in a timely fashion, a growing number of organizations are turning to parallelism to speed up the processing. The US Department of Treasury, for example, has fielded a data-mining application that sifts through all large cash transactions reported by banks and casinos to detect potential money laundering. The application runs on a 6-processor Sun server
Keywords :
deductive databases; financial data processing; government data processing; knowledge acquisition; parallel processing; query processing; 6-processor Sun server; US Department of Treasury; artificial intelligence techniques; banks; cash transactions; casinos; data mining; data pattern recognition; genetic algorithms; knowledge discovery; machine learning; money laundering; multidimensional database querying; neural networks; online application processing tools; parallelism; pattern prediction; rule-based systems; siftware; statistical techniques; stored data; Application software; Artificial intelligence; Artificial neural networks; Data mining; Databases; Knowledge based systems; Machine learning; Multidimensional systems; Parallel processing; Pattern recognition;