• DocumentCode
    1056413
  • Title

    Parallelism speeds data mining

  • Author

    Hedberg, Sara Reese

  • Author_Institution
    Issaquah, WA, USA
  • Volume
    3
  • Issue
    4
  • fYear
    1995
  • Firstpage
    3
  • Lastpage
    6
  • Abstract
    Increasingly, scientists and business people are mining mountains of stored data for valuable nuggets of insight using computerized data mining (also called knowledge discovery in databases, or KDD) techniques to learn more about customer behavior, discover new quasars, and catch crooks. Data mining recognizes patterns in data and predicts patterns from the data. Today´s data-mining techniques, sometimes called `siftware´, range from OLAP (online application processing) tools that query multidimensional databases, to various statistical techniques, to advanced artificial intelligence techniques like machine learning, neural networks, rule-based systems and genetic algorithms. To efficiently mine gigabytes and terabytes of data in a timely fashion, a growing number of organizations are turning to parallelism to speed up the processing. The US Department of Treasury, for example, has fielded a data-mining application that sifts through all large cash transactions reported by banks and casinos to detect potential money laundering. The application runs on a 6-processor Sun server
  • Keywords
    deductive databases; financial data processing; government data processing; knowledge acquisition; parallel processing; query processing; 6-processor Sun server; US Department of Treasury; artificial intelligence techniques; banks; cash transactions; casinos; data mining; data pattern recognition; genetic algorithms; knowledge discovery; machine learning; money laundering; multidimensional database querying; neural networks; online application processing tools; parallelism; pattern prediction; rule-based systems; siftware; statistical techniques; stored data; Application software; Artificial intelligence; Artificial neural networks; Data mining; Databases; Knowledge based systems; Machine learning; Multidimensional systems; Parallel processing; Pattern recognition;
  • fLanguage
    English
  • Journal_Title
    Parallel & Distributed Technology: Systems & Applications, IEEE
  • Publisher
    ieee
  • ISSN
    1063-6552
  • Type

    jour

  • DOI
    10.1109/88.473600
  • Filename
    473600