• DocumentCode
    3340016
  • Title

    Classification models for outbreak detection in oil and gas pollution area

  • Author

    Bakar, Afarulrazi Abu ; Idris, Norisma ; Hamdan, Abdul Razak ; Othman, Zulkifli ; Nazari, Mohd Zakri Ahmad ; Zainudin, S.

  • Author_Institution
    Center for Artificial Intell. Technol., Univ. Kebangsaan, Bangi, Malaysia
  • fYear
    2011
  • fDate
    17-19 July 2011
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This study aim to investigate the data mining task and techniques specifically sequential pattern mining on the outbreak detection in oil and gas pollution area. The sequential pattern mining can be treated as a classification problem if enough data for certain sequence of time is available, as association problem if large number of related attributes are available, or can be seen as the deviation detection problem if the available data contain only few rare pattern or outliers. In this paper, the classification technique, decision tree is used for classification, and association rules mining is used for the outbreak detection task in oil and gas air dataset. The study found that unsupervised clustering using K-Means algorithm potentially obtain the rarely patterns of data distributing on several groups of pollutants and the average levels of supervised classification using the decision tree is a bit higher than the levels of association rules mining classification and appropriately used to classify the data by contaminants. Association rules mining on the other hand produce several sequences rules of contaminants. This study has high potential in producing quality rules for outbreak detection.
  • Keywords
    data mining; decision trees; environmental science computing; oil pollution; pattern classification; pattern clustering; pollution measurement; K-means algorithm; association rules mining classification; contaminants; data mining tasks; data mining techniques; decision tree; gas pollution outbreak detection; oil pollution outbreak detection; pattern classification; pollutants; sequential pattern mining; unsupervised clustering; Accuracy; Association rules; Classification algorithms; Diseases; Pollution; Training; Outbreak detection; association and sequential patterns; decision tree;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical Engineering and Informatics (ICEEI), 2011 International Conference on
  • Conference_Location
    Bandung
  • ISSN
    2155-6822
  • Print_ISBN
    978-1-4577-0753-7
  • Type

    conf

  • DOI
    10.1109/ICEEI.2011.6021832
  • Filename
    6021832