• DocumentCode
    2742080
  • Title

    A Comparative Study for Outlier Detection Techniques in Data Mining

  • Author

    Bakar, Zuriana Abu ; Mohemad, Rosmayati ; Ahmad, Akbar ; Deris, Mustafa Mat

  • Author_Institution
    Dept. of Comput. Sci., Univ. Coll. of Sci. & Technol., Kuala Terengganu
  • fYear
    2006
  • fDate
    7-9 June 2006
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Existing studies in data mining mostly focus on finding patterns in large datasets and further using it for organizational decision making. However, finding such exceptions and outliers has not yet received as much attention in the data mining field as some other topics have, such as association rules, classification and clustering. Thus, this paper describes the performance of control chart, linear regression, and Manhattan distance techniques for outlier detection in data mining. Experimental studies show that outlier detection technique using control chart is better than the technique modeled from linear regression because the number of outlier data detected by control chart is smaller than linear regression. Further, experimental studies shows that Manhattan distance technique outperformed compared with the other techniques when the threshold values increased
  • Keywords
    data mining; regression analysis; Manhattan distance performance; control chart performance; data mining; linear regression performance; outlier detection; Association rules; Clustering algorithms; Computer science; Control charts; Data mining; Decision making; Educational institutions; Information technology; Intrusion detection; Linear regression; clustering; data mining; outlier;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cybernetics and Intelligent Systems, 2006 IEEE Conference on
  • Conference_Location
    Bangkok
  • Print_ISBN
    1-4244-0023-6
  • Type

    conf

  • DOI
    10.1109/ICCIS.2006.252287
  • Filename
    4017846